BriefGPT.xyz
Mar, 2024
多模态表示的校准:无需标注的群体稳健性追求
Calibrating Multi-modal Representations: A Pursuit of Group Robustness without Annotations
HTML
PDF
Chenyu You, Yifei Min, Weicheng Dai, Jasjeet S. Sekhon, Lawrence Staib...
TL;DR
探索在不使用任何组标注的情况下减轻CLIP对虚假特征依赖的方法,通过基于对比学习的轻量级表示校准方法对预训练CLIP进行微调,从而显著减少依赖并大大提升模型的泛化能力。
Abstract
fine-tuning
pre-trained vision-language models
, like
clip
, has yielded success on diverse downstream tasks. However, several pain points p
→