BriefGPT.xyz
May, 2024
揭示基于LLM的中文开源数据集上的ASR潜力
Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
HTML
PDF
Xuelong Geng, Tianyi Xu, Kun Wei, Bingsheng Mu, Hongfei Xue...
TL;DR
基于大型语言模型的自动语音识别研究,探索了多种配置下的语音编码器、语言模型和投影模块对ASR性能的影响,采用三阶段训练方法实现了在中文数据集上的最佳表现,为未来LLM基于ASR系统的研究提供了实证基础和性能优化的见解。
Abstract
large language models
have demonstrated unparalleled effectiveness in various NLP tasks, and integrating LLMs with
automatic speech recognition
is becoming a mainstream paradigm. Building upon this momentum, our
→