BriefGPT.xyz
Mar, 2022
面向视觉Transformer的补丁相似度感知无数据量化
Patch Similarity Aware Data-Free Quantization for Vision Transformers
HTML
PDF
Zhikai Li, Liping Ma, Mengjuan Chen, Junrui Xiao, Qingyi Gu
TL;DR
提出了PSAQ-ViT,这是一种基于自注意力模块的Patch Similarity Aware数据无关量化框架,可以通过生成“逼真”样本来校准量化参数,从而实现Vision transformers在资源受限设备上的部署。
Abstract
vision transformers
have recently gained great success on various computer vision tasks; nevertheless, their high model complexity makes it challenging to deploy on resource-constrained devices.
quantization
is a
→