TL;DR本研究提出了一种基于条件 GAN 的新方法,通过合成俯视图像,将两个视图之间的差距最小化,实现了对视觉实体的跨视图建模并进行特征融合,最终在 CVUSA 数据集上成功实现了景点检索任务。
Abstract
The visual entities in cross-view images exhibit drastic domain changes due
to the difference in viewpoints each set of images is captured from. Existing
state-of-the-art methods address the problem by learning view-invariant
descriptors for the images. We propose a novel method for so