BriefGPT.xyz
Mar, 2022
量化感知训练中克服振荡问题
Overcoming Oscillations in Quantization-Aware Training
HTML
PDF
Markus Nagel, Marios Fournarakis, Yelysei Bondarenko, Tijmen Blankevoort
TL;DR
本文研究神经网络的量化问题,发现在低比特率下,深度可分离网络(如MobileNets,EfficientNets)量化训练中,量化权重可能出现意外震荡,导致在推断过程中统计错误、在训练过程中增加噪声,进而显著降低准确性。作者提出了两种新的QAT算法,分别是自适应调节震荡和迭代冻结权重,相较已有算法都表现出了更好的效果。
Abstract
When training
neural networks
with simulated
quantization
, we observe that quantized weights can, rather unexpectedly, oscillate between two grid-points. The importance of this effect and its impact on
→