BriefGPT.xyz
Feb, 2024
带剪裁的随机梯度下降法秘密估计中位数梯度
SGD with Clipping is Secretly Estimating the Median Gradient
HTML
PDF
Fabian Schaipp, Guillaume Garrigos, Umut Simsekli, Robert Gower
TL;DR
基于中位数估计的坚实梯度方法在随机梯度下降算法中能够应对重尾、状态相关性噪声,在分布式学习、隐私约束等领域有广泛应用。本研究在采样、几何中位数计算及迭代中都提出了基于中位数梯度估计的方法,并发现多种已知算法可看作此方法的特例。
Abstract
There are several applications of
stochastic optimization
where one can benefit from a robust estimate of the gradient. For example, domains such as
distributed learning
with corrupted nodes, the presence of larg
→