早停法是非参数变分推断

Apr, 2015

Early Stopping is Nonparametric Variational Inference

Dougal Maclaurin, David Duvenaud, Ryan P. Adams

TL;DR本研究使用非参数变分近似后验分布的样本抽取来解释随机梯度下降，为基于最小下限的对数边际似然的超参数优化提供一种输出，包括神经网络等领域。

Abstract

We show that unconverged stochastic gradient descent can be interpreted as a procedure that samples from a nonparametric variational approximate posterior distribution. This distribution is implicitly defined as the transformation of an initial distribution by a sequence of optimizatio