deep neural networks (DNNs) have achieved outstanding performance in a wide
range of applications, e.g., image classification, natural language processing,
etc. Despite the good performance, the huge number of parameters in DNNs brings
challenges to efficient training of DNNs and also