BriefGPT.xyz
May, 2023
非参数最近邻辅助微调神经机器翻译
Non-parametric, Nearest-neighbor-assisted Fine-tuning for Neural Machine Translation
HTML
PDF
Jiayi Wang, Ke Wang, Yuqi Zhang, Yu Zhao, Pontus Stenetorp
TL;DR
研究探究了在微调阶段引入kNN预测的统计数据来提高基线翻译模型,发现通过引入gating机制,kNN的真实概率和强化学习三种方法,相比于传统的微调,可以在四个标准机器翻译数据集上实现一致的改进,尤其于翻译语法关系或功能词时表现出更大的提升。
Abstract
non-parametric
,
k-nearest-neighbor
algorithms have recently made inroads to assist generative models such as language models and
machine translat
→