Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Model

التفاصيل البيبلوغرافية
العنوان: Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Model
المؤلفون: Fang, Hung-Chieh, Ye, Nai-Xuan, Shih, Yi-Jen, Peng, Puyuan, Wang, Hsuan-Fu, Berry, Layne, Lee, Hung-Yi, Harwath, David
المصدر: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) Acoustics, Speech, and Signal Processing Workshops (ICASSPW), 2024 IEEE International Conference on. :645-649 Apr, 2024
Relation: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)
قاعدة البيانات: IEEE Xplore Digital Library
الوصف
ردمك:9798350374513
DOI:10.1109/ICASSPW62465.2024.10625802