BriefGPT.xyz
May, 2023
NeuralMatrix:将整个神经网络转换为通用矩阵乘法以实现高效推断
NeuralMatrix: Moving Entire Neural Networks to General Matrix Multiplication for Efficient Inference
HTML
PDF
Ruiqi Sun, Jie Zhao, Xin He, Yiran Li, An Zou
TL;DR
本研究提出了一种新的框架NeuralMatrix,可以在一个单一的通用矩阵乘法加速器上计算多功能的深度神经网络,同时在性能和特定应用的加速水平方面与CPU和GPU等通用处理器相比具有优势。
Abstract
In this study, we introduce
neuralmatrix
, a novel framework that enables the computation of versatile
deep neural networks
(DNNs) on a single
gen
→