BriefGPT.xyz
Jun, 2011
基于委员会的概率分类器样本选择
Committee-Based Sample Selection for Probabilistic Classifiers
HTML
PDF
S. Argamon-Engelson, I. Dagan
TL;DR
本文研究了如何通过样本选择来降低训练中的注释成本,介绍了一种基于概率分类模型的委员会选样本的经验方法,这些变体是从与迄今为止标记的训练集条件化的概率分布中随机抽取的。其在自然语言处理领域,如随机词性标注任务上得到了显著的注释成本降低和模型尺寸缩小的效果。
Abstract
In many real-world learning tasks, it is expensive to acquire a sufficient number of labeled examples for training. This paper investigates methods for reducing
annotation cost
by `
sample selection
'. In this appr
→