BriefGPT.xyz
Jun, 2024
大型语言模型人类偏好学习综述
A Survey on Human Preference Learning for Large Language Models
HTML
PDF
Ruili Jiang, Kehai Chen, Xuefeng Bai, Zhixuan He, Juntao Li...
TL;DR
本综述从以偏好为中心的角度回顾了探索大型语言模型(LLMs)的人类偏好学习的进展,包括偏好反馈的来源和格式,偏好信号的建模和使用,以及对齐LLMs的评估。
Abstract
The recent surge of versatile
large language models
(LLMs) largely depends on aligning increasingly capable foundation models with human intentions by
preference learning
, enhancing LLMs with excellent applicabil
→