BriefGPT.xyz
May, 2021
Pareto-Optimal Quantized ResNet 可优化量化为主要 4 位
Pareto-Optimal Quantized ResNet Is Mostly 4-bit
HTML
PDF
AmirAli Abdolrashidi, Lisa Wang, Shivani Agrawal, Jonathan Malmaud, Oleg Rybakov...
TL;DR
研究表明,使用4位和8位模型量化的bfloat16 ResNet模型计算成本和准确性的权衡曲线优于bfloat16模型,其中以4位模型量化为主的模型具有最佳Pareto曲线,并且基于量化感知训练的4位ResNet-50模型在ImageNet上取得了77.09%的准确率。
Abstract
quantization
has become a popular technique to compress
neural networks
and reduce compute cost, but most prior work focuses on studying
quantiza
→