BriefGPT.xyz
Jul, 2022
偏见的诞生:一项关于英语语言模型中性别偏见演变的研究
The Birth of Bias: A case study on the evolution of gender bias in an English language model
HTML
PDF
Oskar van der Wal, Jaap Jumelet, Katrin Schulz, Willem Zuidema
TL;DR
研究发现使用 LSTM 架构训练的语言模型在表示性别时存在动态变化,并且性别信息逐渐局部化。通过监控训练动态,可以检测到女性和男性在输入嵌入中的表示不对称。去除偏见的策略如何应用需要更多深入探讨。
Abstract
Detecting and mitigating harmful
biases
in modern
language models
are widely recognized as crucial, open problems. In this paper, we take a step back and investigate how
→