AbstractDespite the recent success in many applications, the high computational requirements of
vision transformers limit their use in resource-constrained settings. While many existing methods improve the quadratic complexity of attention, in most
→