BriefGPT.xyz
Nov, 2019
长篇语音识别的端到端模型比较
A comparison of end-to-end models for long-form speech recognition
HTML
PDF
Chung-Cheng Chiu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko...
TL;DR
本研究调查和提高端到端模型在长篇转录上的性能。实验比较了不同的端到端模型并证明RNN-T模型在这种场景下比注意力模型更加鲁棒,并且使用限制注意力单调性和分段解码算法等两种改进方法,将注意力模型的性能极大提升,达到了和RNN-T模型相当的水平。
Abstract
End-to-end
automatic speech recognition
(ASR) models, including both
attention-based models
and the
recurrent neural network transducer
(R
→