BriefGPT.xyz
Mar, 2021
Bit-Mixer: 运行时位宽选择的混合精度网络
Bit-Mixer: Mixed-precision networks with runtime bit-width selection
HTML
PDF
Adrian Bulat, Georgios Tzimiropoulos
TL;DR
本文提出了 Bit-Mixer 方法,为高度精准预测训练多量化层的混合精度网络,在测试期间任何层都可以改变自己的比特宽度,并通过“转换批量归一化”和3阶段优化,展示了网络的训练过程以及具有理想的灵活性属性的混合精度网络可供设备部署,不会影响推断准确度。
Abstract
mixed-precision networks
allow for a
variable bit-width quantization
for every layer in the network. A major limitation of existing work is that the bit-width for each layer must be predefined during training tim
→