BriefGPT.xyz
Dec, 2022
DISCO: 利用大型语言模型提取短语反事实
DISCO: Distilling Phrasal Counterfactuals with Large Language Models
HTML
PDF
Zeming Chen, Qiyue Gao, Kyle Richardson, Antoine Bosselut, Ashish Sabharwal
TL;DR
该论文提出了一种名为DISCO的新框架,可以使用大规模语言模型生成高质量的反事实数据,并借助特定于任务的老师模型过滤生成,以提高模型的稳健性和泛化性能。实验结果表明,使用这种方式进行学习,学生模型的鲁棒性和跨分布能力比基线提高了6%(绝对)和5%。
Abstract
Recent methods demonstrate that
data augmentation
using
counterfactual knowledge
can teach models the causal structure of a task, leading to robust and generalizable models. However, such counterfactual data ofte
→