BriefGPT.xyz
Oct, 2021
MobileViT:轻量级、通用、面向移动的视觉Transformer
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
HTML
PDF
Sachin Mehta, Mohammad Rastegari
TL;DR
本文提出了一种MobileViT轻量级通用视觉变换器,将transformers视为卷积,可用于移动设备,取得了比CNN和ViT更好的性能,特别是在对象检测任务上。
Abstract
light-weight
convolutional neural networks
(CNNs) are the de-facto for mobile vision tasks. Their spatial inductive biases allow them to learn representations with fewer parameters across different vision tasks.
→