BriefGPT.xyz
Jul, 2023
基于学习的阈值令牌合并和修剪用于视觉Transformer
Learned Thresholds Token Merging and Pruning for Vision Transformers
HTML
PDF
Maxim Bonnaerens, Joni Dambre
TL;DR
这篇论文介绍了一种名为LTMP的学习阈值符号合并和修剪方法,它通过动态确定合并和修剪的符号,以降低计算视觉变换器所需的输入符号数量,实现了在降低速率的同时保持最先进的准确性,在仅一个微调阶段的情况下比先前的方法快一个数量级以上。
Abstract
vision transformers
have demonstrated remarkable success in a wide range of computer vision tasks over the last years. However, their high
computational costs
remain a significant barrier to their practical deplo
→