L2C: 描述视觉差异需要对个体进行语义理解

Feb, 2021

L2C: 描述视觉差异需要对个体进行语义理解

L2C: Describing Visual Differences Needs Semantic Understanding of Individuals

An Yan, Xin Eric Wang, Tsu-Jui Fu, William Yang Wang

TL;DR本文介绍了一种Learning-to-Compare模型，该模型能够理解两个图像之间的语义结构并学习描述每个图像，从而有效地进行图像比较和生成描述。使用该模型可以在Birds-to-Words数据集上实现比基准模型更好的性能，且同时在自动评估和人类评估中表现良好。

Abstract

Recent advances in language and vision push forward the research of captioning a single image to describing visual differences between image pairs. Suppose there are two images, I_1 and I_2, and the task is to ge