BriefGPT.xyz
Jun, 2024
大型语言模型的分阶段指导微调
Phased Instruction Fine-Tuning for Large Language Models
HTML
PDF
Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang
TL;DR
通过渐进对齐的假设,我们提出了一种新颖的分阶段指令微调(Phased IFT)方法,基于难度评分并使用逐步训练的方式显著地提高了预训练语言模型的指令遵循能力。
Abstract
instruction fine-tuning
, a method enhancing
pre-trained language models
' capabilities from mere next-word prediction to complex instruction following, often employs a one-off training approach on diverse instruct
→