BriefGPT.xyz
Jun, 2021
XCiT: 跨协方差图像变换器
XCiT: Cross-Covariance Image Transformers
HTML
PDF
Alaaeldin El-Nouby, Hugo Touvron, Mathilde Caron, Piotr Bojanowski, Matthijs Douze...
TL;DR
本文介绍了基于交叉协方差矩阵的交叉协方差注意力(XCA),用于高分辨率图像的高效处理。文章基于XCA构建了交叉协方差图像变换器(XCiT),并在多个视觉基准测试中取得了优异的结果,包括ImageNet-1k上的图像分类和自监督特征学习,COCO上的目标检测和实例分割以及ADE20K上的语义分割。
Abstract
Following their success in natural language processing,
transformers
have recently shown much promise for
computer vision
. The
self-attention
→