使用发音辅助的子词建模提高端到端语音识别

Nov, 2018

使用发音辅助的子词建模提高端到端语音识别

Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling

Hainan Xu, Shuoyang Ding, Shinji Watanabe

TL;DR本文提出一种发音辅助子词建模方法（PASM），该方法利用单词的发音信息提取子词，实验表明该方法可以比基于字符的基准方法和常用的字节对编码方法更好地提高语音识别精度。

Abstract

In recent years, end-to-end models have become popular for application in automatic speech recognition. Compared to hybrid approaches, which perform the phone-sequence to word conversion based on a lexicon, an