使用稀疏卷积和指导剪枝加速CNN

Aug, 2016

Holistic SparseCNN: Forging the Trident of Accuracy, Speed, and Size

Jongsoo Park, Sheng R. Li, Wei Wen, Hai Li, Yiran Chen...

TL;DR本文提出一种同时实现卷积神经网络的规模经济和速度提升的方法，包括一种有效的一般性稀疏-稠密矩阵乘法实现以及一种性能模型，可以预测不同层和不同计算机架构的稀疏水平的最佳值，该方法可在包括移动设备和超级计算机在内的各种处理器上实现3.1-7.3倍的卷积速度提升。

Abstract

We present Holistic SparseCNN, a sparse convolutional neural network design that simultaneously optimizes convolution layers (for classification speed) and fully connected layers (for model size), while maintaining the accuracy. We directly apply convolutions to tensors without bandwidth-wasting lowering step, which is critical for sparse convolution that is