BriefGPT.xyz
Feb, 2024
指令调优的局限性
A Closer Look at the Limitations of Instruction Tuning
HTML
PDF
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S, Deepali Aneja...
TL;DR
在本研究中,通过对LLMs进行严格实验和深入分析,我们发现Instruction Tuning的各种限制,比如IT无法增强LLMs的知识或技能、从知识来源中复制响应模式会导致响应质量下降、全参数微调会增加虚构错误等。同时,我们的研究还表明,仅从预训练知识中生成的响应始终优于通过IT学习任何形式的新知识的模型生成的响应。
Abstract
instruction tuning
(IT), the process of training
large language models
(LLMs) using instruction-response pairs, has emerged as the predominant method for transforming base pre-trained LLMs into open-domain conver
→