BriefGPT.xyz
Mar, 2019
利用单词嵌入改善跨领域中文分词
Improving Cross-Domain Chinese Word Segmentation with Word Embeddings
HTML
PDF
Yuxiao Ye, Weikang Li, Yue Zhang, Likun Qiu, Jian Sun
TL;DR
本文提出了一种基于半监督学习的词嵌入方法,用于提高跨领域中文分词的性能,实验证明该方法在小样本领域中表现良好,可以优化分词结果,尤其是在分割具有特定领域名词实体的数据集时较为有效。
Abstract
cross-domain
chinese word segmentation
(CWS) remains a challenge despite recent progress in neural-based CWS. The limited amount of annotated data in the target domain has been the key obstacle to a satisfactory
→