BriefGPT.xyz
May, 2020
从多个来源选择回译数据以改进神经机器翻译
Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation
HTML
PDF
Xabier Soto, Dimitar Shterionov, Alberto Poncelas, Andy Way
TL;DR
本文提出了使用不同的机器翻译方法进行回译来生成合成训练语料,并使用数据选择策略来优化其性能,进而提高低资源语言的机器翻译质量。结果表明,这种方法可以有效地提升机器翻译的性能。
Abstract
machine translation
(MT) has benefited from using synthetic training data originating from translating monolingual corpora, a technique known as
backtranslation
. Combining backtranslated data from different sourc
→