BriefGPT.xyz
May, 2017
使用通用和特定词嵌入来分类研究的翻译阶段
Utility of general and specific word embeddings for classifying translational stages of research
HTML
PDF
Vincent Major, Alisa Surkis, Yindalon Aphinyanaphongs
TL;DR
本文探讨使用无监督学习的方法,通过单词嵌入在词向量空间内学习语义相似性,以实现对文本分类任务的性能优化。研究发现,使用领域特定的词嵌入可以提高分类性能。
Abstract
Conventional
text classification
models make a bag-of-words assumption reducing text, fundamentally a sequence of words, into word occurrence counts per document. Recent algorithms such as word2vec and fastText are capable of learning
→