BriefGPT.xyz
Apr, 2022
大规模流式端到端语音翻译基于神经转录器
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
HTML
PDF
Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur
TL;DR
本文介绍了如何将神经转录器引入流式端到端语音翻译(ST)中,提出了基于注意力池化的Transformer transducer(TT)模型以及在多语言ST中的应用,结果表明TT模型不仅显著减少了推理时间,而且在英德翻译上优于基于ASR和MT的非流式级联ST。
Abstract
neural transducers
have been widely used in automatic speech recognition (ASR). In this paper, we introduce it to streaming
end-to-end speech translation
(ST), which aims to convert audio signals to texts in othe
→