BriefGPT.xyz
Mar, 2024
冻结视觉语言模型的测试时视觉识别中的上下文提示学习
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
HTML
PDF
Junhui Yin, Xinyu Zhang, Lin Wu, Xianghua Xie, Xiaojie Wang
TL;DR
通过测试样本的无监督目标,在视觉识别任务中使用上下文提示学习来适应预训练的视觉-语言模型,并取得了在各种下游数据集上的有效结果。
Abstract
Existing
pre-trained vision-language models
, e.g., CLIP, have demonstrated impressive
zero-shot generalization capabilities
in various downstream tasks. However, the performance of these models will degrade signi
→