May, 2023

动态稀疏是通道级稀疏的学习器

TL;DR本文提出 Channel-aware dynamic sparse (Chase) 方法:将 unstructured dynamic sparsity 转变为 GPU-friendly channel-level sparsity 加速 inference,通过逐渐去除 biased parameter reallocation across channels,不损失准确率地实现了 1.7 X inference throughput speedup on common GPU devices with ResNet-50 on ImageNet。