BriefGPT.xyz
Nov, 2021
PTQ4ViT:基于双等距量化的视觉Transformer后训练量化框架
PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
HTML
PDF
Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun
TL;DR
本文提出了双均匀量化方法和用 Hessian 指导的指标方法来优化视觉转换器上量化的准确度,提出了一个高效的框架 PTQ4ViT,实验证明量化视觉转换器在 ImageNet 分类任务上能够实现接近无损的预测准确度(8 位量化的准确度降低小于 0.5%)。
Abstract
quantization
is one of the most effective methods to compress
neural networks
, which has achieved great success on convolutional
neural networks<
→