BriefGPT.xyz
Jun, 2020
稻草堆中的针:极度类别不平衡下的标注效率评估
A general framework for label-efficient online evaluation with asymptotic guarantees
HTML
PDF
Neil G. Marchant, Benjamin I. P. Rubinstein
TL;DR
本文提出了一种基于自适应重要性抽样的在线评估框架,该框架可通过自适应分布来标记物品,以最大化统计精度,并通过实验验证,利用Dirichlet-tree模型实现了比固定标签预算的最新技术平均MSE更高的结果。
Abstract
Achieving statistically significant evaluation with passive sampling of test data is challenging in settings such as
extreme classification
and
record linkage
, where significant class imbalance is prevalent. Adap
→