BriefGPT.xyz
Feb, 2022
F8Net: 仅限定点8位乘法用于网络量化
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
HTML
PDF
Qing Jin, Jian Ren, Richard Zhuang, Sumant Hanumante, Zhengang Li...
TL;DR
F8Net是一种完全由固定点8位乘法构成的量化框架,可以降低神经网络量化模型与完全精度模型之间的性能差距,并显著降低内存占用和能源消耗。
Abstract
neural network quantization
is a promising compression technique to reduce
memory footprint
and save
energy consumption
, potentially leadi
→