BriefGPT.xyz
Jul, 2024
图像到文本的逻辑越狱:你的想象力可以帮助你任何事情
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything
HTML
PDF
Xiaotian Zou, Yongkang Chen
TL;DR
使用大型视觉语言模型(VLMs)生成有深度和细致度的回复是非常成功的,然而,结合视觉输入后,新的安全隐患出现了,恶意攻击者可以利用多种模态来达到他们的目的。本文聚焦于通过有意义的图像来产生针对性文本,揭示了当前VLMs在图像转文本方面存在的严重漏洞,强调在实际部署之前需要对VLMs的安全漏洞进行深入研究。
Abstract
large visual language models
(VLMs) such as GPT-4 have achieved remarkable success in generating comprehensive and nuanced responses, surpassing the capabilities of large language models. However, with the integration of visual inputs, new
→