BriefGPT.xyz
Oct, 2022
遮蔽图像建模的统一视角
A Unified View of Masked Image Modeling
HTML
PDF
Zhiliang Peng, Li Dong, Hangbo Bao, Qixiang Ye, Furu Wei
TL;DR
本文提出了一种统一的视角,针对现有方法进行了修订,并引入了一种名为MaskDistill的简单而有效的方法,通过对受损输入图像的屏蔽位置的主题模型重新构建归一化语义特征,以解决大规模训练视觉转换器中需要大量标记的问题,实验结果表明,MaskDistill在图像分类和语义分割方面的表现优于现有技术。
Abstract
masked image modeling
has demonstrated great potential to eliminate the label-hungry problem of training large-scale
vision transformers
, achieving impressive performance on various downstream tasks. In this work
→