BriefGPT.xyz
May, 2016
数据编程: 快速创建大规模训练集
Data Programming: Creating Large Training Sets, Quickly
HTML
PDF
Alexander Ratner, Christopher De Sa, Sen Wu, Daniel Selsam, Christopher Ré
TL;DR
为解决有限数据训练集的问题,本研究提出一种名为Data Programming的范式,通过弱监督策略和领域启发式标注函数生成训练集,以生成模型表示训练集的标注过程并降噪,探讨数据编程在监督学习中的应用及在TAC-KBP数据集上的检测等实验与研究。
Abstract
Large labeled training sets are the critical building blocks of
supervised learning
methods and are key enablers of
deep learning
techniques. For some applications, creating labeled training sets is the most time
→