TL;DR提出使用互信息测量方法的主动学习模型,使用 Bayesian linear basis functions 模型,在训练聚合数据的回归模型时减少标注集的成本,并实现更好的预测性能。
Abstract
Due to the privacy protection or the difficulty of data collection, we cannot
observe individual outputs for each instance, but we can observe aggregated
outputs that are summed over multiple instances in a set in some real-world
applications. To reduce the labeling cost for training regressi