OSCaR: 词向量中正交子空间矫正及偏差校正

Jun, 2020

OSCaR: 词向量中正交子空间矫正及偏差校正

OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings

Sunipa Dev, Tao Li, Jeff M Phillips, Vivek Srikumar

TL;DR本文提出了一种名为OSCaR的新的降低偏见的方法，该方法专注于解开概念之间的偏见关联而非整体去除概念。实验结果表明，OSCaR方法保证了嵌入中的语义信息被保留且能够有效地缓解偏见，特别是在性别偏见的情况下表现出良好的平衡性。

Abstract

Language representations are known to carry stereotypical biases and, as a result, lead to biased predictions in downstream tasks. While existing methods are effective at mitigating biases by linear projection, such methods are too aggressive: they not only remove bias, but also erase