BriefGPT.xyz
Jan, 2022
2020年代用于ConvNet(卷积神经网络)的神经网络
A ConvNet for the 2020s
HTML
PDF
Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell...
TL;DR
本研究重新审视设计空间,逐步将标准ResNet现代化为Vision Transformer的设计,发现了几个关键组件,并发现纯ConvetNets模型家族ConvNeXt可以在精度和可伸缩性方面与Transformer竞争,在ImageNet的top-1准确率方面达到了87.8%,并在COCO检测和ADE20K分割上优于Swin Transformer 。
Abstract
The "Roaring 20s" of visual recognition began with the introduction of
vision transformers
(ViTs), which quickly superseded
convnets
as the state-of-the-art image classification model. A vanilla ViT, on the other
→