BriefGPT.xyz
Apr, 2025
QAVA:针对大型视觉语言模型的查询无关视觉攻击
QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
HTML
PDF
Yudong Zhang, Ruobing Xie, Jiansheng Chen, Xingwu Sun, Zhanhui Kang...
TL;DR
本研究针对大规模视觉语言模型(LVLMs)在视觉问答(VQA)任务中对特定图像和问题的脆弱性,提出了一种查询无关视觉攻击(QAVA)。这一新方法能够生成对未知问题产生错误响应的稳健对抗样本,从而显著提高了在未知问题情况下攻击的有效性和效率,揭示了LVLMs在视觉对抗威胁中的新兴漏洞。
Abstract
In typical multimodal tasks, such as
Visual Question Answering
(VQA), adversarial attacks targeting a specific image and question can lead large
Vision-Language Models
(LVLMs) to provide incorrect answers. Howeve
→