BriefGPT.xyz
Jun, 2024
朝向无需发音模型的无监督语音识别
Towards Unsupervised Speech Recognition Without Pronunciation Models
HTML
PDF
Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao...
TL;DR
本研究采用不依赖音素词典的新方法,通过仅包含高频英语词汇的语料库,在没有配对语音和文字数据的情况下,实现了近20%的词错误率,并证明了基于联合语音到语音和文本到文本的标记填充技术,使得无监督语音识别系统的性能超过了直接分布匹配方法。
Abstract
Recent advancements in
supervised automatic speech recognition
(ASR) have achieved remarkable performance, largely due to the growing availability of large transcribed speech corpora. However, most languages lack sufficient
→