BriefGPT.xyz
May, 2021
协作智能的神经网络特征张量轻量化压缩
Lightweight compression of neural network feature tensors for collaborative intelligence
HTML
PDF
Robert A. Cohen, Hyomin Choi, Ivan V. Bajić
TL;DR
本研究介绍了一种轻量级的压缩技术,用于在边缘设备上进行代码的分割,仅针对深度神经网络中的激活,而且不需要任何重新训练。当应用于流行的对象检测和分类深度神经网络时,能够将32位浮点激活压缩到0.6至0.8位,同时保持精度损失不到1%。
Abstract
In
collaborative intelligence
applications, part of a
deep neural network
(DNN) is deployed on a relatively low-complexity device such as a mobile phone or
→