BriefGPT.xyz
Jan, 2024
MT中LLM的上下文学习能力的实证分析
An Empirical Analysis of In-context Learning Abilities of LLMs for MT
HTML
PDF
Pranjal A. Chitale, Jay Gala, Varun Gumma, Mitesh M. Khapra, Raj Dabre
TL;DR
探讨了大型语言模型在上下文学习中的能力,并研究了上下文演示的不同方面对机器翻译任务的影响。观察到不同模型家族对扰动示例呈现不同的行为,表明上下文学习的鲁棒性可能受到多种因素的影响。需要进一步研究来全面了解这些因素。
Abstract
in-context learning
(ICL) has consistently demonstrated superior performance over zero-shot performance in
large language models
(LLMs). However, the understanding of the dynamics of ICL and the aspects that infl
→