BriefGPT.xyz
Jan, 2020
使用基于模式的权重修剪在移动设备上实现实时深度神经网络执行
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
HTML
PDF
Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian...
TL;DR
该研究提出了一种新的维度——在粗粒度结构内使用细粒度裁剪,以达到在移动设备上高效执行深度神经网络的效果,并通过编译器进行优化,取得了良好的效果。
Abstract
With the emergence of a spectrum of high-end
mobile devices
, many applications that formerly required desktop-level computation capability are being transferred to these devices. However, executing the inference of
deep
→