BriefGPT.xyz
Feb, 2022
神经机器翻译中的数据缩放定律: 噪声和架构的影响
Data Scaling Laws in NMT: The Effect of Noise and Architecture
HTML
PDF
Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Maxim Krikun...
TL;DR
本文研究了神经机器翻译中体系结构和训练数据质量的变化对数据缩放性质的影响,并发现使用返向翻译数据会显著降低缩放系数。
Abstract
In this work, we study the effect of varying the architecture and
training data quality
on the
data scaling properties
of
neural machine translat
→