BriefGPT.xyz
May, 2024
奥卡姆梯度下降
Occam Gradient Descent
HTML
PDF
B. N. Kausik
TL;DR
通过应用学习理论,我们提出了Occam梯度下降算法,同时降低神经网络的拓扑结构大小和权重,从而在准确度、计算和模型压缩方面优于传统梯度下降算法。
Abstract
deep learning neural network models
must be large enough to adapt to their problem domain, while small enough to avoid
overfitting
training data during gradient descent. To balance these competing demands, overpr
→