BriefGPT.xyz
Oct, 2020
一种深度神经网络低比特宽度训练的统计框架
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
HTML
PDF
Jianfei Chen, Yu Gai, Zhewei Yao, Michael W. Mahoney, Joseph E. Gonzalez
TL;DR
本论文提出了一个用于分析全量化训练算法的统计框架,并探讨了梯度量化对其收敛性的影响。作者开发了两个新的梯度量化器,并展示了这些量化器相对于现有的每个张量量化器具有更小的方差。
Abstract
fully quantized training
(FQT), which uses low-bitwidth hardware by quantizing the activations, weights, and gradients of a neural network model, is a promising approach to accelerate the training of
deep neural network
→