BriefGPT.xyz
Jan, 2022
线性对抗概念抹除
Linear Adversarial Concept Erasure
HTML
PDF
Shauli Ravfogel, Michael Twiton, Yoav Goldberg, Ryan Cotterell
TL;DR
提出了一种通过线性极小极大博弈模型来定位和清空文本中的线性子空间,以防止线性预测器恢复与偏见相关的概念,该方法可以减轻内在和外在因素造成的偏见。
Abstract
Modern
neural models
trained on textual data rely on
pre-trained representations
that emerge without direct supervision. As these representations are increasingly being used in real-world applications, the inabil
→