BriefGPT.xyz
Feb, 2023
回顾智慧让语言模型成为更好的指令跟随者
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
HTML
PDF
Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez
TL;DR
本文提出一种基于回顾性指令重新标注的新算法 HIR,通过训练模型使其与指令更好地对齐,以解决语言模型中指令对齐的问题,并从12个挑战性的 BigBench 推理任务中的表现证明 HIR 优于基线算法,并且即使超过了有监督微调。
Abstract
reinforcement learning
has seen wide success in finetuning large
language models
to better align with instructions via
human feedback
. The
→