TL;DR通过子空间投影去除word embeddings文本中性别刻板印象,提出了一种新的关联度量RIPA,发现skipgram with negative sampling (SGNS)在训练语料库中并未增加文本准确性别聚类,但对性别刻板印象词汇却增强其性别关联。
Abstract
word embeddings are often criticized for capturing undesirable word associations such as gender stereotypes. However, methods for measuring and removing such biases remain poorly understood. We show that for any