BriefGPT.xyz
Nov, 2023
视觉-语言模型的对抗提示调整
Adversarial Prompt Tuning for Vision-Language Models
HTML
PDF
Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang...
TL;DR
通过引入Adversarial Prompt Tuning (AdvPT)技术,本研究旨在提升视觉-语言模型中图像编码器的对抗性鲁棒性,改善对抗攻击的脆弱性,并且结合现有的基于图像处理的防御技术,进一步提高其防御能力。
Abstract
With the rapid advancement of
multimodal learning
, pre-trained
vision-language models
(VLMs) such as CLIP have demonstrated remarkable capacities in bridging the gap between visual and language modalities. Howeve
→