BriefGPT.xyz
Jun, 2021
基于逐层分步对齐的图像-文本匹配网络
Step-Wise Hierarchical Alignment Network for Image-Text Matching
HTML
PDF
Zhong Ji, Kexin Chen, Haoran Wang
TL;DR
本文提出了一种逐步分层对齐网络 (SHAN) 的图像 - 文本匹配方法,将图像 - 文本匹配分解成多步跨模态推理过程以捕捉层次化的细粒度相关性,并在两个基准数据集上进行了实验。
Abstract
image-text matching
plays a central role in bridging the semantic gap between vision and language. The key point to achieve precise
visual-semantic alignment
lies in capturing the fine-grained cross-modal corresp
→