自然语言处理中的数据增强方法综述

Oct, 2021

自然语言处理中的数据增强方法综述

Data Augmentation Approaches in Natural Language Processing: A Survey

Bohan Li, Yutai Hou, Wanxiang Che

TL;DR本文综述了数据增强的三个类别：释义、加噪和采样，以及在NLP中的应用和挑战。

Abstract

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where deep learning techniques may fail. It is widely applied in computer vision then introduced to natural language processing and achieves improvements in many tasks. One of the main focuses of the DA