利润是否就足够了？基于表格数据的主动学习的广泛经验研究

Oct, 2022

利润是否就足够了？基于表格数据的主动学习的广泛经验研究

Is margin all you need? An extensive empirical study of active learning on tabular data

Dara Bahri, Heinrich Jiang, Tal Schuster, Afshin Rostamizadeh

TL;DR该研究综合分析了在69个现实世界的标签分类数据集上，使用各种主动学习算法在深度神经网络上的表现，结果表明，基于经典边际采样技术的算法在各种实验设置下都能匹配或超越其他算法，甚至包括当前的最佳算法。

Abstract

Given a labeled training set and a collection of unlabeled data, the goal of active learning (AL) is to identify the best unlabeled points to label. In this comprehensive study, we analyze the performance of a variety of →