关于仇恨言论和辱骂语言检测数据集中的种族偏见问题

May, 2019

关于仇恨言论和辱骂语言检测数据集中的种族偏见问题

Racial Bias in Hate Speech and Abusive Language Detection Datasets

Thomas Davidson, Debasmita Bhattacharya, Ingmar Weber

TL;DR本文研究了五个Twitter数据集上使用的基于分类器的识别恶意语言的技术中的种族歧视问题，并在使用这些技术时可能产生的不平等负面影响上发出了警告。

Abstract

Technologies for abusive language detection are being developed and applied with little consideration of their potential biases. We examine racial bias in five different sets of →