Nov, 2023
Archtree: 基于实时树结构探索的深度神经网络低延迟裁剪
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
Rémi Ouazan Reboul, Edouard Yvinec, Arnaud Dapogny, Kevin Bailly
TL;DRArchtree 是一种新的基于延迟驱动的 DNN 结构修剪方法,通过并行地在树形结构中探索多个候选修剪子模型,实时估计目标硬件的延迟,从而更好地适应延迟预算并保持原始模型准确性。