BriefGPT.xyz
Mar, 2024
通过困难负样本增强多模态对比学习中的概念理解
Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples
HTML
PDF
Philipp J. Rösch, Norbert Oswald, Michaela Geierhos, Jindřich Libovický
TL;DR
通过合成困难的负面文字示例,引入了一种新的预训练方法来改善视觉-语言模型中细粒度概念理解的问题,并介绍了一个新的具有挑战性的用于评估颜色、物体和大小细粒度对齐的数据集 InpaintCOCO。
Abstract
Current
multimodal models
leveraging
contrastive learning
often face limitations in developing
fine-grained conceptual understanding
. This
→