BriefGPT.xyz
May, 2018
量化卷积神经网络以用于低功耗高吞吐量推理引擎
Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
HTML
PDF
Sean O. Settle, Manasa Bollavaram, Paolo D'Alberto, Elliott Delaye, Oscar Fernandez...
TL;DR
本文提出了一种量化方案,通过在参考浮点模型上校准而不是重新训练来确定量化方案参数,从而实现了基于更高效的算术进行推理,并且在量化后的结果中,终端到终端精度可与基准模型相当。
Abstract
deep learning
as a means to
inferencing
has proliferated thanks to its versatility and ability to approach or exceed human-level accuracy. These computational models have seemingly insatiable appetites for comput
→