Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg...
TL;DR通过使用少量的训练样本来预测最大的可实现模型性能,以预测数据的质量和样本大小。
Abstract
Knowing exactly how many data points need to be labeled to achieve a certain
model performance is a hugely beneficial step towards reducing the overall
budgets for annotation. It pertains to both active learning