BriefGPT.xyz
Jul, 2022
细粒度图像-文本检索中的配对交叉模态数据增强
Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval
HTML
PDF
Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
TL;DR
该论文研究了一个生成文本-图像对以提高细粒度图像-文本跨模态检索任务训练的开放性研究问题,并提出了一种新的框架用于成对数据增强,以揭示StyleGAN2模型的隐藏语义信息。
Abstract
This paper investigates an open research problem of generating
text-image pairs
to improve the training of fine-grained image-to-text
cross-modal retrieval
task, and proposes a novel framework for paired
→