BriefGPT.xyz
May, 2023
通过主动生成成对的反事实,提高分类器的健壮性
Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals
HTML
PDF
Ananth Balashankar, Xuezhi Wang, Yao Qin, Ben Packer, Nithum Thain...
TL;DR
本文提出一种利用对抗生成模型自动生成对抗样本并用成对分类器对其自动标注的框架,通过对仅10%人工标注的对抗样本数据进行生成,能有效提高情感分类和问题重述任务等自然语言分类器的18-20%稳健性和14-21%误差缩减。
Abstract
counterfactual data augmentation
(CDA) is a commonly used technique for improving
robustness
in
natural language classifiers
. However, one
→