BriefGPT.xyz
Jan, 2024
大型语言模型的自动学习方法
Active Learning for NLP with Large Language Models
HTML
PDF
Xuesong Wang
TL;DR
使用大型语言模型(GPT-3.5和GPT-4)进行标注,研究了主动学习中减少标注成本和采样效率的方法。采用混合注释策略,将可能标注错误的样本与人工注释相结合,可以在AG新闻和腐烂的番茄等数据集上取得与人工注释相似甚至更好的结果,证明了大型语言模型在主动学习中的准确性和成本效益。
Abstract
human annotation
of training samples is expensive, laborious, and sometimes challenging, especially for Natural Language Processing (NLP) tasks. To reduce the labeling cost and enhance the
sample efficiency
,
→