BriefGPT.xyz
May, 2023
毒性检测评估框架:通过反馈评估毒性检测中的基本事实
Toxicity Inspector: A Framework to Evaluate Ground Truth in Toxicity Detection Through Feedback
HTML
PDF
Huriyyah Althunayan, Rahaf Bahlas, Manar Alharbi, Lena Alsuwailem, Abeer Aldayel...
TL;DR
本文介绍了一种毒性语言检测框架,通过考虑人为因素通过迭代反馈循环来提高毒性基准数据集的可靠性,以平衡性能和毒性避免之间的权衡。
Abstract
toxic language
is difficult to define, as it is not monolithic and has many variations in perceptions of toxicity. This challenge of detecting
toxic language
is increased by the highly contextual and subjectivity
→