将神经网络中文分词视为低资源机器翻译任务

Aug, 2020

将神经网络中文分词视为低资源机器翻译任务

Approaching Neural Chinese Word Segmentation as a Low-Resource Machine Translation Task

Pinzhen Chen, Kenneth Heafield

TL;DR本研究利用最佳实践将低资源神经机器翻译应用于受监督的中文分词，实现低成本的模型设计并取得与其他方法相同的最新成果。

Abstract

Supervised chinese word segmentation has been widely approached as sequence labeling or sequence modeling. Recently, some researchers attempted to treat it as character-level translation, but there is still a performance gap between the translation-based approach and other methods. In