BriefGPT.xyz
Jul, 2021
神经机器翻译置信度感知的计划采样
Confidence-Aware Scheduled Sampling for Neural Machine Translation
HTML
PDF
Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
TL;DR
本文提出了基于置信度的定时采样策略,针对神经机器翻译中定时采样策略无法根据实时模型表现进行调整的问题,通过使用模型预测的置信度来量化实时模型表现,并设计了细粒度的采样策略。实验结果表明,该策略在Transformer模型上显著优于 Vanilla定时采样,可以提高翻译质量和收敛速度。
Abstract
scheduled sampling
is an effective method to alleviate the exposure bias problem of
neural machine translation
. It simulates the inference scene by randomly replacing ground-truth target input tokens with predict
→