BriefGPT.xyz
Oct, 2021
通过约束优化实现神经网络混合精度量化
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
HTML
PDF
Weihan Chen, Peisong Wang, Jian Cheng
TL;DR
采用离散约束优化问题和二阶泰勒展开,提出了解决深度神经网络中多精度量化问题的一种高效算法,并在ImageNet数据集和各种网络体系结构上得出了比现有方法更优的结果。
Abstract
quantization
is a widely used technique to compress and accelerate
deep neural networks
. However, conventional
quantization
methods use th
→