自动语音识别的单调分段注意力

Oct, 2022

Monotonic segmental attention for automatic speech recognition

Albert Zeyer, Robin Schmitt, Wei Zhou, Ralf Schlüter, Hermann Ney

TL;DR提出了一种新颖的分段-关注模型用于自动语音识别,使用分段关注避免全局关注的二次运行时间，更好地控制长序列，最终实现流式处理。

Abstract

We introduce a novel segmental-attention model for automatic speech recognition. We restrict the decoder attention to segments to avoid quadratic runtime of global attention, better generalize to long sequences,