BriefGPT.xyz
Mar, 2022
民主化对比语言-图像预训练:一个数据、模型和监督的 CLIP 基准
Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision
HTML
PDF
Yufeng Cui, Lichen Zhao, Feng Liang, Yangguang Li, Jing Shao
TL;DR
本文提出CLIP-benchmark,对CLIP及其变种进行评估、分析和基准测试,并发现了数据、监督和模型架构三个关键因素对性能的影响及应用更恰当的监督可以有效提高CLIP性能。
Abstract
Contrastive Language-Image Pretraining (
clip
) has emerged as a novel paradigm to learn
visual models
from
language supervision
. While rese
→