BriefGPT.xyz
Feb, 2024
LSPT: 长期空间提示调整用于视觉表示学习
LSPT: Long-term Spatial Prompt Tuning for Visual Representation Learning
HTML
PDF
Shentong Mo, Yansen Wang, Xufang Luo, Dongsheng Li
TL;DR
长期空间提示调整 (LSPT) 是一种革命性的视觉表示学习方法,通过引入长期的门控提示,巧妙地结合了时间编码和空间编码,提高了视觉类别的区分和识别能力,同时在5个FGVC和19个VTAB-1K基准测试中展示了优于其他方法的性能。
Abstract
visual prompt tuning
(VPT) techniques have gained prominence for their capacity to adapt pre-trained Vision Transformers (
vits
) to downstream visual tasks using specialized learnable tokens termed as prompts. Con
→