组件增强的汉字嵌入

Aug, 2015

Component-Enhanced Chinese Character Embeddings

Yanran Li, Wenjie Li, Fei Sun, Sujian Li

TL;DR本文创新性地发展了两种增强中文字符嵌入模型及其二元模型扩展，它们通过探索中文字符的组合，来有效地捕捉语义信息并已成功地应用于词语相似度和文本分类任务。

Abstract

distributed word representations are very useful for capturing semantic information and have been successfully applied in a variety of NLP tasks, especially on English. In this work, we innovatively develop two component-enhanced →