BriefGPT.xyz
Jan, 2023
为卷积网络设计BERT: 稀疏和分级掩码建模
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
HTML
PDF
Keyu Tian, Yi Jiang, Qishuai Diao, Chen Lin, Liwei Wang...
TL;DR
本论文通过采用稀疏卷积和分层解码器等新技术,将BERT-风格的预训练方法推广到卷积神经网络领域,并且在ResNet和ConvNeXt等模型上进行了验证,在目标检测和实例分割等任务中,优于当前最先进的对比学习和变换器掩模建模方法。
Abstract
We identify and overcome two key obstacles in extending the success of BERT-style
pre-training
, or the masked image modeling, to
convolutional networks
(convnets): (i) convolution operation cannot handle irregula
→