BriefGPT.xyz
Dec, 2020
混合匹配:一种新的面向 FPGA 的深度神经网络量化框架
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
HTML
PDF
Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K. -H. So...
TL;DR
该论文研究了基于FPGA的深度神经网络模型压缩方法——不同行采用不同的量化方案以充分利用FPGA中LUT和DSP的资源,提出了适用于高斯分布和均匀分布的两种量化方案,并提出了混合方案以保持或提高精度。
Abstract
deep neural networks
(
dnns
) have achieved extraordinary performance in various application domains. To support diverse DNN models, efficient implementations of DNN inference on edge-computing platforms, e.g., ASI
→