Apr, 2024

MM-TTS: 多模态、情绪感应文本转语音综合的统一框架

TL;DRMultimodal Emotional Text-to-Speech System (MM-TTS) is proposed, which leverages emotional cues from multiple modalities, addresses the limitations of current approaches in capturing human emotions, and achieves superior performance compared to traditional Emotional Text-to-Speech models.