BriefGPT.xyz
Feb, 2022
大型预训练语言模型向端到端语音识别器的知识转移
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers
HTML
PDF
Yotaro Kubo, Shigeki Karita, Michiel Bacchiani
TL;DR
本文提出了一种方法,通过从大规模语言模型的嵌入向量获取语义知识来缓解需要耗费大量成本的转录训练的问题,并扩展了注意力机制的解码器和神经音响模式的解码器,以实现错误率的降低。
Abstract
end-to-end speech recognition
is a promising technology for enabling compact automatic speech recognition (ASR) systems since it can unify the acoustic and
language model
into a single
→