BriefGPT.xyz
Dec, 2021
AdaViT: 面向高效视觉Transformer的自适应Token
AdaViT: Adaptive Tokens for Efficient Vision Transformer
HTML
PDF
Hongxu Yin, Arash Vahdat, Jose Alvarez, Arun Mallya, Jan Kautz...
TL;DR
提出了一种自适应调整视觉转换器(ViT)推理成本的方法A-ViT,该方法基于自适应计算时间(ACT)重新表述,在不修改网络架构或推理硬件的情况下,通过自动减少处理网络的视觉转换器中的令牌数来实现此目标,并对图像分类任务性能得到了显著改进。
Abstract
We introduce AdaViT, a method that adaptively adjusts the
inference
cost of vision transformer (ViT) for images of different complexity. AdaViT achieves this by automatically reducing the number of tokens in
vision tran
→