BriefGPT.xyz
Sep, 2018
交叉视图训练的半监督序列建模
Semi-Supervised Sequence Modeling with Cross-View Training
HTML
PDF
Kevin Clark, Minh-Thang Luong, Christopher D. Manning, Quoc V. Le
TL;DR
本文提出一种半监督学习算法Cross-View Training, 结合无标签文本与有标签数据, 通过学习辅助预测模型来提升双向长短时记忆网络(Bi-LSTM)的表示学习能力,取得了在五项序列标记任务,机器翻译和依存句法分析等领域的最优结果。
Abstract
unsupervised representation learning
algorithms such as word2vec and ELMo improve the accuracy of many supervised
nlp
models, mainly because they can take advantage of large amounts of unlabeled text. However, th
→