BriefGPT.xyz
Jun, 2023
UniBoost: 无监督单模态预训练来提升零样本视觉语言任务能力
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
HTML
PDF
Yanan Sun, Zihan Zhong, Qi Fan, Chi-Keung Tang, Yu-Wing Tai
TL;DR
使用大规模非监督单模型预训练可以提高图像-文本匹配的零样本性能和模型理解图像和文本关系的能力
Abstract
Large-scale joint training of
multimodal models
, e.g.,
clip
, have demonstrated great performance in many vision-language tasks. However, image-text pairs for
→