knowledge distillation transfers the knowledge from a cumbersome teacher to a
small student. Recent results suggest that the student-friendly teacher is more
appropriate to distill since it provides more transferable knowledge. In this
work, we propose the novel framework, "prune, then