BriefGPT.xyz
Jun, 2024
引导视觉转换器的视觉提示学习
Learning Visual Prompts for Guiding the Attention of Vision Transformers
HTML
PDF
Razieh Rezaei, Masoud Jalili Sabet, Jindong Gu, Daniel Rueckert, Philip Torr...
TL;DR
通过在输入图像中引入视觉提示信息,本研究旨在为视觉变换器模型设计学习视觉提示,以引导其注意力集中在图像的特定区域,通过自监督学习的方式进行优化,实验结果表明该优化策略在各种预训练视觉编码器中的效果显著。
Abstract
visual prompting
infuses visual information into the input image to adapt
models
toward specific predictions and tasks. Recently, manually crafted
→