BriefGPT.xyz
Aug, 2017
非凸机器学习的二阶优化:一个经验性研究
Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study
HTML
PDF
Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney
TL;DR
本文研究了一类基于牛顿方法的优化算法在非凸机器学习问题中的应用,展示了其可以更好地利用曲率信息来逃离平坦区域和鞍点,并在泛化性能方面表现相当于或优于手动调整学习率的随机梯度下降算法。
Abstract
The resurgence of deep learning, as a highly effective
machine learning
paradigm, has brought back to life the old optimization question of non-convexity. Indeed, the challenges related to the large-scale nature of many modern
→