BitPruning: 学习位长进行激进而精确的量化

Feb, 2020

BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization

Miloš Nikolić, Ghouthi Boukli Hacene, Ciaran Bannon, Alberto Delmas Lascorz, Matthieu Courbariaux...

TL;DR通过提出一种惩罚体系惩罚大位长表示的正则化方法，我们可以在维持准确性的同时，在任意合适的层次上最小化推理位长。

Abstract

neural networks have demonstrably achieved state-of-the art accuracy using low-bitlength integer →