BriefGPT.xyz
Mar, 2018
eSCAPE:用于自动后期编辑的大规模合成语料库
eSCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
HTML
PDF
Matteo Negri, Marco Turchi, Rajen Chatterjee, Nicola Bertoldi
TL;DR
该论文介绍了eSCAPE,这是目前最大的免费合成语料库,为机器翻译的自动后编辑训练模型提供了大量数据,并使用模型在通用领域方案中实验证明了其有效性。
Abstract
training models
for the
automatic correction
of machine-translated text usually relies on data consisting of (source, MT, human post- edit) triplets providing, for each source sentence, examples of translation er
→