Qihan Wang, Chen Dun, Fangshuo Liao, Chris Jermaine, Anastasios Kyrillidis
TL;DR本文研究了Lottery Ticket Hypothesis,并提出了一种称为LOttery ticket through Filter-wise Training的卷积神经网络预训练算法来识别优秀的filters并减少预训练的内存和通信成本,同时保持甚至提高了准确度。
Abstract
Recent work on the lottery ticket hypothesis (LTH) shows that there exist ``\textit{winning tickets}'' in large neural networks. These tickets represent ``sparse'' versions of the full model that can be trained i