BriefGPT.xyz
Mar, 2021
NLP 训练中的辛普森偏差
Simpson's Bias in NLP Training
HTML
PDF
Fei Yuan, Longtu Zhang, Huang Bojun, Yaobo Liang
TL;DR
研究机器学习中,针对不同数据集测量方法与训练模型的不一致性,引起Simpson's bias现象。
Abstract
In most
machine learning
tasks, we evaluate a model $M$ on a given data population $S$ by measuring a
population-level metric
$F(S;M)$. Examples of such evaluation metric $F$ include precision/recall for (binary)
→