BriefGPT.xyz
Apr, 2023
Vision Conformer:将卷积融入 Vision Transformer 层中
Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
HTML
PDF
Brian Kenji Iwana, Akihiro Kusuda
TL;DR
本研究通过将卷积神经网络与神经网络模型Transformer相结合,提出了一种名为“Vision Conformer”的模型,并通过实验证明了此模型对ViT图像识别能力的提升。
Abstract
transformers
are popular neural network models that use layers of self-attention and fully-connected nodes with embedded tokens. Vision
transformers
(ViT) adapt
→