BriefGPT.xyz
Oct, 2023
CLIP 融合模型库专家:视觉增强的伪监督
CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement
HTML
PDF
Mohammadreza Salehi, Mehrdad Farajtabar, Maxwell Horton, Fartash Faghri, Hadi Pouransari...
TL;DR
通过在CLIP训练中结合任务特定的视觉模型,利用伪标签来改进其视觉表示,该简单的设置在不妨碍现有性能的前提下,显著提高了不同视觉任务的效果。
Abstract
contrastive language image pretraining
(CLIP) is a standard method for training
vision-language models
. While CLIP is scalable, promptable, and robust to distribution shifts on image classification tasks, it lack
→