Deep convolutional neural networks have been widely used in numerous
applications, but their demanding storage and computational resource
requirements prevent their applications on mobile devices. Knowledge
distillation aims to optimize a portable student network by taking the
knowledg