语音识别中零-shot领域调适的大型语言模型启发

Jun, 2023

语音识别中零-shot领域调适的大型语言模型启发

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

Yuang Li, Yu Wu, Jinyu Li, Shujie Liu

TL;DR本文介绍了两种使用LLaMA的零样本ASR领域适应方法，这两种方法可以通过一个领域特定的文本提示有效地减少跨领域TedLium-2和SPGISpeech数据集上的词错误率（WER），特别是，深度LLM-fusion具有更好的实体召回和词汇外单词的召回优势。

Abstract

The integration of language models (LMs) has proven to be an effective way to address domain shifts in speech recognition. However, these approaches usually require a significant amount of target domain text data for the training of LMs. Different from these methods, in this work, with