BriefGPT.xyz
Aug, 2017
汉字的字形感知嵌入
Glyph-aware Embedding of Chinese Characters
HTML
PDF
Falcon Z. Dai, Zheng Cai
TL;DR
本文提出了一种新的基于汉字视觉外观的表示方法,采用卷积神经网络来将汉字的空间-结构模式以原始像素的方式统一表示,从而在两个基本的中文NLP任务:语言建模和分词中有效地表征了每个字符的语义和句法信息。
Abstract
Given the advantage and recent success of English character-level and subword-unit models in several NLP tasks, we consider the equivalent modeling problem for
chinese
.
chinese
script is logographic and many
→