BriefGPT.xyz
Oct, 2018
理解词嵌入中偏见的起源
Understanding the Origins of Bias in Word Embeddings
HTML
PDF
Marc-Etienne Brunet, Colleen Alkalay-Houlihan, Ashton Anderson, Richard Zemel
TL;DR
使用我们的方法,可以了解到word embedding偏差的起源,并找出删除哪些文档可以最大程度地降低偏差。我们在《纽约时报》和Wikipedia语料库上演示了我们的技术,并发现我们的影响函数近似非常精确。
Abstract
The power of
machine learning
systems not only promises great technical progress, but risks societal harm. As a recent example, researchers have shown that popular word embedding algorithms exhibit stereotypical biases, such as gender
→