BriefGPT.xyz
Nov, 2023
CLAP:对预先训练的视觉语言模型鲁棒性的增强提示对比学习
CLAP: Contrastive Learning with Augmented Prompts for Robustness on Pretrained Vision-Language Models
HTML
PDF
Yichao Cai, Yuhang Liu, Zhen Zhang, Javen Qinfeng Shi
TL;DR
通过文本增强方法,不需要在对抗性示例上重新训练图像编码器,从而增强视觉-语言模型的稳健性,并且实验证明了在各种数据集上对预训练的CLIP模型的稳健性有显著改善。
Abstract
contrastive vision-language models
, e.g., CLIP, have garnered substantial attention for their exceptional generalization capabilities. However, their
robustness
to
→