BriefGPT.xyz
Oct, 2023
神经网络训练中的离散漂移和平滑正则化
On discretisation drift and smoothness regularisation in neural network training
HTML
PDF
Mihaela Claudia Rosca
TL;DR
通过研究梯度下降算法以及解决离散化漂移问题,从而改善深度学习中的优化和模型正则化,以及探索平滑正则化与优化之间的相互作用。
Abstract
The
deep learning
recipe of casting real-world problems as mathematical
optimisation
and tackling the
optimisation
by training deep neural
→