ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks

التفاصيل البيبلوغرافية
العنوان: ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks
المؤلفون: Jia, Xiaoqi, Tai, Jianwei, Zhou, Hang, Li, Yakai, Zhang, Weijuan, Du, Haichao, Huang, Qingjia
سنة النشر: 2019
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: Despite the remarkable progress made in synthesizing emotional speech from text, it is still challenging to provide emotion information to existing speech segments. Previous methods mainly rely on parallel data, and few works have studied the generalization ability for one model to transfer emotion information across different languages. To cope with such problems, we propose an emotion transfer system named ET-GAN, for learning language-independent emotion transfer from one emotion to another without parallel training samples. Based on cycle-consistent generative adversarial network, our method ensures the transfer of only emotion information across speeches with simple loss designs. Besides, we introduce an approach for migrating emotion information across different languages by using transfer learning. The experiment results show that our method can efficiently generate high-quality emotional speech for any given emotion category, without aligned speech pairs.
Comment: Accepted by ECAI 2020, 8 pages, 4 figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/1905.11173
رقم الأكسشن: edsarx.1905.11173
قاعدة البيانات: arXiv