BriefGPT.xyz
Sep, 2017
AISHELL-1: 一个开放源代码的汉语语音数据集与基准语音识别系统
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
HTML
PDF
Hui Bu, Jiayu Du, Xingyu Na, Bengu Wu, Hao Zheng
TL;DR
发布了名为AISHELL-1的开源普通话语音语料库,是目前适用于进行普通话语音识别研究和构建普通话语音识别系统的最大语料库,实验结果表明音频录制和转录的质量是有前途的。
Abstract
An open-source
mandarin speech corpus
called
aishell-1
is released. It is by far the largest corpus which is suitable for conducting the
speech r
→