BriefGPT.xyz
Aug, 2021
ConvNets与Transformers:哪个视觉表示更易转移?
ConvNets vs. Transformers: Whose Visual Representations are More Transferable?
HTML
PDF
Hong-Yu Zhou, Chixiang Lu, Sibei Yang, Yizhou Yu
TL;DR
通过15项单任务和多任务性能评估,系统地研究了ConvNets和vision transformers的迁移学习能力,发现vision transformers在13个下游任务中表现出一致优势,并且更适合于多任务学习。
Abstract
vision transformers
have attracted much attention from computer vision researchers as they are not restricted to the spatial inductive bias of
convnets
. However, although Transformer-based backbones have achieved
→