BriefGPT.xyz
Dec, 2017
通过在Mel频谱预测上调节WaveNet,进行自然语音合成
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
HTML
PDF
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly...
TL;DR
该论文阐述了 Tacotron 2 的神经网络框架,该框架可以从文本中直接合成语音,其系统由一种递归的序列到序列的特征预测网络和一个修改的 WaveNet 模型组成,能够实现与专业录制的语音相当的平均意见分数 (MOS)。
Abstract
This paper describes
tacotron 2
, a
neural network
architecture for
speech synthesis
directly from text. The system is composed of a recurr
→