Si Yi Meng, Antonio Orvieto, Daniel Yiming Cao, Christopher De Sa
TL;DR研究了使用大的恒定步长的逻辑回归问题上的梯度下降(GD)动态。
Abstract
We study gradient descent (GD) dynamics on logistic regression problems with large, constant step sizes. For linearly-separable data, it is known that GD converges to the minimizer with arbitrarily large