BriefGPT.xyz
Sep, 2017
基于低内存GEMM的深度神经网络卷积算法
Low-memory GEMM-based convolution algorithms for deep neural networks
HTML
PDF
Andrew Anderson, Aravind Vasudevan, Cormac Keane, David Gregg
TL;DR
本文提出两种新型基于GEMM的算法,分别只需要额外的O(MHW)和O(KW)的空间,显著降低了DNN卷积的空间开销,适用于内存受限的嵌入式系统,并且实验表明我们的低内存算法和最好的图案构建方法一样快,尽管需要的额外内存只相当于后者的一小部分。
Abstract
deep neural networks
(DNNs) require very large amounts of computation both for training and for inference when deployed in the field. A common approach to implementing DNNs is to recast the most computationally expensive operations as
→