TL;DR本文提出 Deep Incubation 训练方法,将大型深度学习模型分为互相连接的子模块进行训练,并经过实验证明在训练效率和准确率方面优于 end-to-end 训练方法。
Abstract
Recent years have witnessed a remarkable success of large deep learning
models. However, training these models is challenging due to high computational
costs, painfully slow convergence, and overfitting issues. I