BriefGPT.xyz
Aug, 2023
朝着具有CLIP的逼真无监督微调
Towards Realistic Unsupervised Fine-tuning with CLIP
HTML
PDF
Jian Liang, Lijun Sheng, Zhengbo Wang, Ran He, Tieniu Tan
TL;DR
通过将视觉语言模型(VLMs)应用于下游监督学习任务,本文探讨了无监督微调CLIP模型,解决了未知类别的样本和识别预定义类别实例的问题,并提出了一种称为通用熵优化(UEO)的简单有效的微调方法。通过广泛的实验,我们证明了UEO方法在泛化能力和检测未知类别样本方面优于基线方法。
Abstract
The emergence of
vision-language models
(VLMs), such as CLIP, has spurred a significant research effort towards their application for
downstream supervised learning
tasks. Although some previous studies have expl
→