BriefGPT.xyz
Oct, 2023
神经网络可行的无鞍牛顿优化的Hessian-Vector乘积系列
Series of Hessian-Vector Products for Tractable Saddle-Free Newton Optimisation of Neural Networks
HTML
PDF
Elre T. Oldewage, Ross M. Clarke, José Miguel Hernández-Lobato
TL;DR
提出了一个既能解决大规模的Hessian矩阵问题,又能优化非凸性的优化算法,采用了一个无限级数截断的方法,并在多种情境下进行了验证,包括在CIFAR-10上训练的ResNet-18模型。
Abstract
Despite their popularity in the field of continuous optimisation, second-order
quasi-newton methods
are challenging to apply in machine learning, as the
hessian matrix
is intractably large. This computational bur
→