Zhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang...
TL;DR大规模语言模型通过使用弱到强的搜索方法进行调整,以增强模型的效果并提高模型的对齐能力。
Abstract
large language models are usually fine-tuned to align with human preferences.
However, fine-tuning a large language model can be challenging. In this work,
we introduce $\textit{weak-to-strong search}$, framing t