BriefGPT.xyz
Jul, 2019
基于端到端自动语音识别的音素与字形表示分析
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
HTML
PDF
Yonatan Belinkov, Ahmed Ali, James Glass
TL;DR
本文分析了自动语音识别中使用的端到端神经网络模型的内部表示学习,对音素和字母、不同发音特征进行了比较,并发现不同特征在深度神经网络的不同层中的表示具有明显的一致性。
Abstract
End-to-end
neural network
systems for
automatic speech recognition
(ASR) are trained from acoustic features to text transcriptions. In contrast to modular ASR systems, which contain separately-trained components
→