BriefGPT.xyz
Oct, 2016
带有陈旧梯度的随机梯度MCMC
Stochastic Gradient MCMC with Stale Gradients
HTML
PDF
Changyou Chen, Nan Ding, Chunyuan Li, Yizhe Zhang, Lawrence Carin
TL;DR
在SG-MCMC中使用过期参数进行随机梯度计算在收敛性方面影响未知,但我们的理论表明,这仅影响偏差和均方误差,而估计方差与流逝度无关,在分布式系统中有一定的可扩展性和线性加速减少方差。
Abstract
stochastic gradient mcmc
(SG-MCMC) has played an important role in large-scale
bayesian learning
, with well-developed theoretical
convergence pro
→