BriefGPT.xyz
Mar, 2024
使用部分海森矩阵的 SGD 优化深度神经网络
SGD with Partial Hessian for Deep Neural Networks Optimization
HTML
PDF
Ying Sun, Hongwei Yong, Lei Zhang
TL;DR
基于二阶算法和Hessian矩阵的优化器SGD-PH在深度神经网络训练中取得了良好的性能。
Abstract
Due to the effectiveness of
second-order algorithms
in solving classical optimization problems, designing second-order optimizers to train
deep neural networks
(DNNs) has attracted much research interest in recen
→