BriefGPT.xyz
Sep, 2023
消除CLIP数据的神秘
Demystifying CLIP Data
HTML
PDF
Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes...
TL;DR
以数据筛选为核心的对比语言-图像预训练及元数据筛选的方法MetaCLIP,在多个标准基准测试中优于CLIP以CommonCrawl为数据源的结果,MetaCLIP在零样本ImageNet分类中达到70.8%的准确率,并在1B数据的情况下保持相同的训练预算达到72.4%的准确率。
Abstract
contrastive language-image pre-training
(
clip
) is an approach that has advanced research and applications in computer vision, fueling modern recognition systems and generative models. We believe that the main ing
→