BriefGPT.xyz
Aug, 2022
K-MHaS:韩国在线新闻评论中的多标签仇恨言论检测数据集
K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment
HTML
PDF
Jean Lee, Taejun Lim, Heejun Lee, Bogeun Jo, Yangsok Kim...
TL;DR
介绍了适用于韩语模式的多标记数据集K-MHaS,基于六种不同的指标使用韩语BERT模型进行评估,其中具有子字符令牌化器的KR-BERT优于其他模型。
Abstract
online hate speech
detection has become important with the growth of digital devices, but resources in languages other than English are extremely limited. We introduce K-MHaS, a new
multi-label dataset
for hate s
→