BriefGPT.xyz
Jan, 2022
低资源神经机器翻译的高性价比训练
Cost-Effective Training in Low-Resource Neural Machine Translation
HTML
PDF
Sai Koneru, Danni Liu, Jan Niehues
TL;DR
提出了一种利用自监督学习和小规模词典来初始化神经机器翻译(NMT)模型,在初始化后使用主动学习策略提高低资源条件下(如稀缺语言)翻译模型性能的方法,并提出了一种基于领域适应的新型主动学习策略。除此之外,我们还表明,使用这种初始化方法和主动学习策略可相比于传统方法提高最多13个BLEU点。
Abstract
While
active learning
(AL) techniques are explored in
neural machine translation
(NMT), only a few works focus on tackling low annotation budgets where a limited number of sentences can get translated. Such situa
→