BriefGPT.xyz
Nov, 2024
基于CTC的LLM辅助上下文自动语音识别
CTC-Assisted LLM-Based Contextual ASR
HTML
PDF
Guanrou Yang, Ziyang Ma, Zhifu Gao, Shiliang Zhang, Xie Chen
TL;DR
本研究解决了现有自动语音识别系统在识别稀有词汇时的局限性。我们提出了一种CTC辅助的上下文自动语音识别模型,通过有效的过滤算法提升了识别稀有长尾词汇的准确性。实验证明,该模型在Librispeech测试集上显著提高了识别性能,相较于基线模型和其他相关工作,展现出强大的潜在影响。
Abstract
Contextual ASR
or hotword customization holds substantial practical value. Despite the impressive performance of current end-to-end (E2E) automatic speech recognition (ASR) systems, they often face challenges in accurately recognizing
→