BriefGPT.xyz
Jul, 2024
评估模型偏差需要表征其错误
Evaluating Model Bias Requires Characterizing its Mistakes
HTML
PDF
Isabela Albuquerque, Jessica Schrouff, David Warde-Farley, Taylan Cemgil, Sven Gowal...
TL;DR
适当基准模型性能是重要的,以便构建更好的预测器并增加对模型正常运行的信心。我们引入了SkewSize,它是一种捕捉模型预测中偏见的度量,能够在多类设置或开放词汇生成模型的情况下使用。SkewSize能够突显其他指标未捕捉到的偏见,并提供对近期技术的影响的见解。
Abstract
The ability to properly
benchmark
model performance
in the face of
spurious correlations
is important to both build better predictors and
→