BriefGPT.xyz
May, 2024
語言模型在有害言論檢測中表現出性別流?言論偏見
Harmful Speech Detection by Language Models Exhibits Gender-Queer Dialect Bias
HTML
PDF
Rebecca Dorn, Lee Kezar, Fred Morstatter, Kristina Lerman
TL;DR
对社交媒体平台上的内容审查进行分析,研究其对性别多元化言语模式的偏见,并提出五个现成的语言模型在评估这些文本的伤害程度时的性能评估。
Abstract
content moderation
on
social media platforms
shapes the dynamics of online discourse, influencing whose voices are amplified and whose are suppressed. Recent studies have raised concerns about the
→