BriefGPT.xyz
Sep, 2022
增强注意力机制的Citrinet语音识别模型
Attention Enhanced Citrinet for Speech Recognition
HTML
PDF
Xianchao Wu
TL;DR
本文介绍了一种基于卷积神经网络的语音识别模型Citrinet,利用多头注意力机制提高了模型的收敛速度并降低了字符错误率。实验结果表明,该模型在日语CSJ-500h和Magic-1600h数据集上的表现优于现有模型。
Abstract
citrinet
is an end-to-end convolutional Connectionist Temporal Classification (CTC) based
automatic speech recognition
(ASR) model. To capture local and global contextual information, 1D time-channel separable co
→