BriefGPT.xyz
Nov, 2022
解决中文字符表示瓶颈问题:基于笔画序列建模的神经机器翻译
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling
HTML
PDF
Zhijun Wang, Xuebo Liu, Min Zhang
TL;DR
本篇论文提出了一种名为StrokeNet的新型汉字表示方法,它通过拉丁化的笔划序列为汉字表示,解决了学习瓶颈和参数瓶颈问题,可应用于神经机器翻译中,有效提高翻译性能并减少模型参数。
Abstract
Existing research generally treats
chinese character
as a minimum unit for representation. However, such
chinese character
representation will suffer two bottlenecks: 1) Learning bottleneck, the learning cannot b
→