BriefGPT.xyz
Mar, 2024
图像-文本检索的跨模态和单模态软标签对齐
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
HTML
PDF
Hailang Huang, Zhijie Nie, Ziqiao Wang, Ziyu Shang
TL;DR
通过引入交叉模态和单模态软标签对齐(CUSA)方法,我们解决了图像-文本检索中的两个问题:模态间匹配缺失和模态内语义损失。实验证明,我们的方法可以提升图像-文本检索以及单模态检索的性能,达到新的最先进水平。
Abstract
Current
image-text retrieval
methods have demonstrated impressive performance in recent years. However, they still face two problems: the
inter-modal matching
missing problem and the
→