BriefGPT.xyz
Nov, 2020
纯字符级神经机器翻译的理解:以从芬兰语到英语的翻译为例
Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
HTML
PDF
Gongbo Tang, Rico Sennrich, Joakim Nivre
TL;DR
本文探讨了纯字符级模型在芬兰语到英语机器翻译中的效果,并证明了字符序列中不同位置的字符在学习语言知识方面扮演着不同的角色。通过实验证明,单头的基于字级别的注意力机制会导致 BLEU 分数下降 1.2 分。
Abstract
Recent work has shown that deeper character-based
neural machine translation
(NMT) models can outperform subword-based models. However, it is still unclear what makes deeper
character-based models
successful. In
→