毒性检测评估框架：通过反馈评估毒性检测中的基本事实

May, 2023

毒性检测评估框架：通过反馈评估毒性检测中的基本事实

Toxicity Inspector: A Framework to Evaluate Ground Truth in Toxicity Detection Through Feedback

Huriyyah Althunayan, Rahaf Bahlas, Manar Alharbi, Lena Alsuwailem, Abeer Aldayel...

TL;DR本文介绍了一种毒性语言检测框架，通过考虑人为因素通过迭代反馈循环来提高毒性基准数据集的可靠性，以平衡性能和毒性避免之间的权衡。

Abstract

toxic language is difficult to define, as it is not monolithic and has many variations in perceptions of toxicity. This challenge of detecting toxic language is increased by the highly contextual and subjectivity