BriefGPT.xyz
Feb, 2024
小型语言模型能为较大语言模型选择调整训练数据
Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models
HTML
PDF
Dheeraj Mekala, Alex Nguyen, Jingbo Shang
TL;DR
通过基于样本学习百分比的训练数据选择,我们展示了当前语言模型具备自主选择高质量训练数据的能力,这极大地降低了训练成本且达到或超过整个数据集训练的性能表现。
Abstract
instruction-tuning
language models
has become a crucial step in aligning them for general use. Typically, this process involves extensive training on large datasets, incurring high training costs. In this paper,
→