BriefGPT.xyz
Jan, 2024
无人工指导的有限样本LLM自校准
Human-Instruction-Free LLM Self-Alignment with Limited Samples
HTML
PDF
Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang...
TL;DR
我们研究了如何在样本有限的情况下,通过使用上下文学习示例和迭代调整算法,自动对齐大型语言模型,以实现几乎不需要人工监督的自我泛化对齐能力。
Abstract
aligning large language models
(
llms
) with human values is a vital task for LLM practitioners. Current alignment techniques have several limitations: (1) requiring a large amount of annotated data; (2) demanding
→