BriefGPT.xyz
Sep, 2023
高保真度语音合成的最小监督方法:全部使用扩散模型
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
HTML
PDF
Chunyu Qiang, Hao Li, Yixin Tian, Yi Zhao, Ying Zhang...
TL;DR
我们提出了一种基于扩散模型的最小监督高保真语音合成方法,其中所有模块均基于扩散模型构建,非自回归框架增强了可控性,持续时间扩散模型实现了多样化的韵律表达。
Abstract
text-to-speech
(TTS) methods have shown promising results in voice cloning, but they require a large number of labeled text-speech pairs.
minimally-supervised speech synthesis
decouples TTS by combining two types
→