BriefGPT.xyz
Dec, 2023
大型语言模型的一次学习作为指导数据矿工
One Shot Learning as Instruction Data Prospector for Large Language Models
HTML
PDF
Yunshui Li, Binyuan Hui, Xiaobo Xia, Jiaxi Yang, Min Yang...
TL;DR
利用奇点法选择高质量的训练数据进行指令调整以优化大型语言模型的性能。在两个基准测试中,采用奇点法选择的前1%的样本比传统方法使用完整数据集要表现更好,强调了优先考虑质量的数据选择范例可以更高效地对齐大型语言模型和人类。
Abstract
Aligning
large language models
(LLMs) with human is a critical step in effectively utilizing their pre-trained capabilities across a wide array of language tasks. Current
instruction tuning
practices often rely on
→