BriefGPT.xyz
May, 2020
无需更多数据:通过文本到语音数据增强来提高端到端语音识别
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation
HTML
PDF
Aleksandr Laptev, Roman Korostik, Aleksey Svischev, Andrei Andrusenko, Ivan Medennikov...
TL;DR
采用数据增强和TTS技术,对ASR的训练数据进行扩充,并通过集成语言模型,在LibriSpeech数据上建立end-to-end模型,相对于半监督技术的效果更好。
Abstract
data augmentation
is one of the most effective ways to make end-to-end
automatic speech recognition
(ASR) perform close to the conventional hybrid approach, especially when dealing with
→