BriefGPT.xyz
Sep, 2023
视觉语言提示学习中的重参数化编码器
PRE: Vision-Language Prompt Learning with Reparameterization Encoder
HTML
PDF
Anh Pham Thi Minh
TL;DR
PRE是一种简单而高效的方法,通过使用一种 prompt 编码器来重新参数化输入 prompt 嵌入,从而增强对从少量样本中探索任务特定知识的能力,其在新类上实现了5.60%的平均准确率提升和3%的调和平均数提升。
Abstract
Large
pre-trained vision-language models
such as CLIP have demonstrated great potential in
zero-shot transferability
to downstream tasks. However, to attain optimal performance, the manual selection of prompts is
→