BriefGPT.xyz
Jun, 2024
循环稀疏训练:足够吗?
Cyclic Sparse Training: Is it Enough?
HTML
PDF
Advait Gadhikar, Sree Harsha Nelaturu, Rebekka Burkholz
TL;DR
通过重复周期性训练,我们提出了SCULPT-ing方法,即通过稀疏掩膜的重复周期性训练,然后进行单次剪枝步骤以耦合参数和掩膜,从而在减少计算成本的同时,在高稀疏度条件下达到与最先进的迭代剪枝方法相匹配的性能。
Abstract
The success of
iterative pruning methods
in achieving state-of-the-art
sparse networks
has largely been attributed to improved mask identification and an implicit regularization induced by pruning. We challenge t
→