BriefGPT.xyz
Nov, 2023
多模型深度学习推理流水线的自动异构低比特量化
Automated Heterogeneous Low-Bit Quantization of Multi-Model Deep Learning Inference Pipeline
HTML
PDF
Jayeeta Mondal, Swarnava Dey, Arijit Mukherjee
TL;DR
该论文介绍了一种自动异构量化方法,用于具有多个深度神经网络的深度学习推理流水线。
Abstract
Multiple
deep neural networks
(DNNs) integrated into single
deep learning
(DL) inference pipelines e.g.
multi-task learning
(MTL) or
→