Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee...
TL;DR通过提出一个新的 deep neural networks 训练基准 (TBD),并在三个主要的深度学习框架 (TensorFlow、MXNet、CNTK) 上进行广泛的性能分析,本文为 DNN 训练提供了一套新的分析工具集以及对未来研究和优化的建议。
Abstract
The recent popularity of deep neural networks (DNNs) has generated a lot of research interest in performing DNN-related computation efficiently. However, the primary focus is usually very narrow and limited to (i) inference -- i.e. how to efficiently execute already trained models and