BriefGPT.xyz
Jan, 2023
从文本学会说话:无监督文本预训练的零射多语言语音合成
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
HTML
PDF
Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi...
TL;DR
使用零样本学习和多语言语言模型,该研究提出了一种只使用目标语言文本数据进行多语言语音合成(TTS)的方法,其能够成功地为只有文本资源的低资源语言开发TTS系统,大大拓展了TTS的覆盖范围并能取得高度理解度。
Abstract
While
neural text-to-speech
(TTS) has achieved human-like natural synthetic speech,
multilingual tts
systems are limited to resource-rich languages due to the need for paired text and studio-quality audio data. T
→