BriefGPT.xyz
Sep, 2020
HiFiSinger: 面向高保真神经歌声合成
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
HTML
PDF
Jiawei Chen, Xu Tan, Jian Luan, Tao Qin, Tie-Yan Liu
TL;DR
本研究提出了一种名为HiFiSinger的SVS系统,通过采用FastSpeech基于百度的语音模型和Parallel WaveGAN模型,使用小波变换处理声波时频信息,采用多级对抗训练,在高采样率情况下合成高保真度的歌唱声音。
Abstract
High-fidelity
singing
voices usually require higher
sampling rate
(e.g., 48kHz) to convey expression and emotion. However, higher
sampling rate
→