面向自然语言处理的预训练表征的高效主动学习

Feb, 2024

Towards Efficient Active Learning in NLP via Pretrained Representations

Artem Vysogorets, Achintya Gopal

TL;DR通过在主动学习循环中使用预训练的大型语言模型的表示，然后在获得所需标记数据后，对这些数据进行微调，从而以较低的计算成本实现与将完全微调的模型相似的性能。

Abstract

fine-tuning large language models (LLMs) is now a common approach for text classification in a wide range of applications. When labeled do