双向LSTM在中文分词中的最新进展

Aug, 2018

State-of-the-art Chinese Word Segmentation with Bi-LSTMs

Ji Ma, Kuzman Ganchev, David Weiss

TL;DR在中文分词任务中，与更复杂的神经网络模型相比，双向LSTM模型结合标准深度学习技术和最佳实践能够在许多流行数据集上实现更好的精度。此外，错误分析表明，对于神经网络模型而言，词汇外的单词仍然具有挑战性，其余错误不太可能通过架构更改来修复，而是应该更加努力地探索资源以进一步提高精度。

Abstract

A wide variety of neural-network architectures have been proposed for the task of chinese word segmentation. Surprisingly, we find that a bidirectional →