AbstractWith the increasing popularity and the increasing size of
vision transformers (ViTs), there has been an increasing interest in making them more efficient and less computationally costly for deployment on edge devices with limited computing resources.
→