BriefGPT.xyz
Jul, 2021
CMT:卷积神经网络与视觉Transformer相遇
CMT: Convolutional Neural Networks Meet Vision Transformers
HTML
PDF
Jianyuan Guo, Kai Han, Han Wu, Chang Xu, Yehui Tang...
TL;DR
本文提出了一种基于Transformer和CNN的新型混合神经网络(CMTs),通过捕捉图像中的长程依赖和建模本地特征,实现了比现有的DeiT和EfficientNet更高的精度和更小的计算成本。
Abstract
vision transformers
have been successfully applied to
image recognition
tasks due to their ability to capture long-range dependencies within an image. However, there are still gaps in both performance and
→