BriefGPT.xyz
Apr, 2023
目标很重要:理解自监督目标对视觉Transformer表示的影响
Objectives Matter: Understanding the Impact of Self-Supervised Objectives on Vision Transformer Representations
HTML
PDF
Shashank Shekhar, Florian Bordes, Pascal Vincent, Ari Morcos
TL;DR
本研究分析了视觉变压器自监督学习的两种主要范式,在结构和可转移性方面的影响差异,揭示了联合嵌入特征在分类线性探针传输方面表现更好的原因。
Abstract
joint-embedding based learning
(e.g., SimCLR, MoCo, DINO) and
reconstruction-based learning
(e.g., BEiT, SimMIM, MAE) are the two leading paradigms for
→