BriefGPT.xyz
Jul, 2022
基于搜索引擎图像检索的多模态神经机器翻译
Multimodal Neural Machine Translation with Search Engine Based Image Retrieval
HTML
PDF
ZhenHao Tang, XiaoBing Zhang, Zi Long, XiangHua Fu
TL;DR
本文提出使用图像搜索引擎和文本感知的注意力视觉编码器来收集并过滤具有描述性的图像,以加强神经机器翻译的性能。在多个数据集上进行的实验证明,该方法较强的基线实现了显著的性能提升。
Abstract
Recently, numbers of works shows that the performance of
neural machine translation
(NMT) can be improved to a certain extent with using
visual information
. However, most of these conclusions are drawn from the a
→