BriefGPT.xyz
Sep, 2021
基于经典SMT视角的NMT训练过程:语言建模、词汇翻译、排序
Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT
HTML
PDF
Elena Voita, Rico Sennrich, Ivan Titov
TL;DR
通过研究 NMT 模型训练过程中的能力表现,发现其在学习目标语言模型、逐词翻译和复杂重排序模式方面的能力表现与传统的 SMT 模型有明显差异,并探讨了这种理解对于优化 NMT 模型的实际应用。
Abstract
Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components,
neural machine translation
uses a single neural network to model the entire translation process. Despite
→