BriefGPT.xyz
Jul, 2023
Sumformer: 一种用于语音识别的线性复杂度替代自注意力机制的算法
Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition
HTML
PDF
Titouan Parcollet, Rogier van Dalen, Shucong Zhang, Sourav Bhattacharya
TL;DR
本文提出了一种自注意代替算法——摘要混合(Summary Mixing),它使用时间步骤的平均向量对整个话语进行总结,并在最先进的语音识别模型中引入这一方法,降低了训练和推理时间达27%,将内存预算减少了一半。
Abstract
Modern
speech recognition
systems rely on
self-attention
. Unfortunately, token mixing with
self-attention
takes quadratic time in the leng
→