BriefGPT.xyz
Mar, 2019
深度神经网络固定点推断的准确有效的训练量化阈值
Trained Uniform Quantization for Accurate and Efficient Neural Network Inference on Fixed-Point Hardware
HTML
PDF
Sambhav R. Jain, Albert Gural, Michael Wu, Chris Dick
TL;DR
使用标准反向传播和梯度下降法提出了一种对均匀对称量化器进行训练阈值(TQT)的方法,能够以8位量化重新训练不到5次即可在MobileNets等传统难度网络上实现接近浮点精度的分类性能。
Abstract
We propose a method of training quantization clipping thresholds for uniform
symmetric quantizers
using standard
backpropagation
and gradient descent. Our quantizers are constrained to use
→