BriefGPT.xyz
Nov, 2021
Tip-Adapter:面向视觉语言模型的无需训练的CLIP适配器
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
HTML
PDF
Renrui Zhang, Rongyao Fang, Peng Gao, Wei Zhang, Kunchang Li...
TL;DR
该论文提出了一种名为 Tip-Adapter 的基于 CLIP 的适配器模型,通过无需训练的键值缓存模型构建配适器权重,极大地提升了 CLIP 的少样本分类能力。
Abstract
Contrastive Vision-Language Pre-training, known as
clip
, has provided a new paradigm for learning
visual representations
by using large-scale
con
→