中文神经分词学习

Jun, 2016

Neural Word Segmentation Learning for Chinese

Deng Cai, Hai Zhao

TL;DR本文提出了一种新颖的神经网络框架，利用门控组合神经网络和LSTM语言评分模型，消除上下文窗口，可以利用完整的分词历史，产生分布式表示，从而实现中文分词，并在基准数据集上进行实验，结果不需要使用现有方法的特征工程，获得了与现有最先进方法相当甚至更好的性能。

Abstract

Most previous approaches to chinese word segmentation formalize this problem as a character-based sequence labeling task so that only contextual information within fixed sized local windows and simple interactions between adjacent tags can be captured. In this paper, we propose a novel