BriefGPT.xyz
Apr, 2024
利用跨模态邻居表示改进 CLIP 分类
Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification
HTML
PDF
Chao Yi, Lu Ren, De-Chuan Zhan, Han-Jia Ye
TL;DR
通过自动生成高质量多样文本,利用CrOss-moDal nEighbor Representation (CODER) 对CLIP进行特征提取,提高CLIP在单模态特征提取上的性能,进而充分发挥其强大的跨模态匹配能力。
Abstract
clip
showcases exceptional
cross-modal matching
capabilities due to its training on image-text contrastive learning tasks. However, without specific optimization for unimodal scenarios, its performance in single-
→