重新思考文本数据增强的有效性: 一个实证分析

Jun, 2023

重新思考文本数据增强的有效性: 一个实证分析

Rethink the Effectiveness of Text Data Augmentation: An Empirical Analysis

Zhengxiang Shi, Aldo Lipani

TL;DR本文研究评估了三种不同的微调方法在七种不同的自然语言处理任务中的效果，结果表明数据增强可以有效提高微调后的模型性能，特别是在少样本学习任务中，持续的预训练可以将性能提高10%以上。

Abstract

In recent years, language models (LMs) have made remarkable progress in advancing the field of natural language processing (NLP). However, the impact of data augmentation (DA) techniques on the →