BriefGPT.xyz
Sep, 2023
基于梯度分位数截断的鲁棒随机优化
Robust Stochastic Optimization via Gradient Quantile Clipping
HTML
PDF
Ibrahim Merad, Stéphane Gaïffas
TL;DR
引入了一种剪裁策略,使用梯度范数的分位数作为剪裁阈值,为平滑目标(凸或非凸)提供鲁棒且高效的优化算法,容忍重尾样本和数据中的异常值,数学分析说明了其收敛性质以及对初始估计误差的高概率界限,并通过实验证实了其高效性和鲁棒性。
Abstract
We introduce a
clipping strategy
for
stochastic gradient descent
(SGD) which uses quantiles of the gradient norm as clipping thresholds. We prove that this new strategy provides a robust and efficient
→