通过修剪激活梯度加速CNN训练

Aug, 2019

Accelerating CNN Training by Sparsifying Activation Gradients

Xucheng Ye, Jianlei Yang, Pengcheng Dai, Yiran Chen, Weisheng Zhao

TL;DR通过修剪更小的梯度和考虑激活梯度的统计分布，我们提出了一种方法来加速CNN训练，这将不会影响准确率。

Abstract

Gradients to activations get involved in most of the calculations during back propagation procedure of Convolution Neural Networks (CNNs) training. However, an important known observation is that the majority of these gradients are close to zero, imposing little impact on weights updat