BriefGPT.xyz
Apr, 2017
神经机器翻译集成的展开和收缩
Unfolding and Shrinking Neural Machine Translation Ensembles
HTML
PDF
Felix Stahlberg, Bill Byrne
TL;DR
该研究通过将集合模型解开成单个大型神经网络,并采用降维技术,旨在提高神经机器翻译的性能和运行速度。在英日翻译任务中,该网络的大小和解码速度与单个 NMT 网络相当,而性能却相当于一个 3-ensemble 系统。
Abstract
ensembling
is a well-known technique in
neural machine translation
(NMT). Instead of a single neural net, multiple neural nets with the same topology are trained separately, and the decoder generates predictions
→