BriefGPT.xyz
Aug, 2024
通过测试时提示调优适应开放类的视觉-语言模型
Adapting Vision-Language Models to Open Classes via Test-Time Prompt Tuning
HTML
PDF
Zhengqing Gao, Xiang Ao, Xu-Yao Zhang, Cheng-Lin Liu
TL;DR
本研究解决了将预训练模型适应开放类场景的挑战,尤其是在新类出现时提示的通用性不足问题。通过提出一种测试时提示调优的方法,利用最大概念匹配评分生成输入条件的提示,从而增强模型性能。实验表明,该方法在多个数据集上优于现有所有对比方法,具有显著提升效果。
Abstract
Adapting pre-trained models to
Open Classes
is a challenging problem in machine learning.
Vision-Language Models
fully explore the knowledge of text modality, demonstrating strong
→