This paper introduces a novel approach to enhance the performance of the stochastic gradient descent (SGD) algorithm by incorporating a modified decay step size based on $\frac{1}{\sqrt{t}}$. The proposed step size integrates a logarithmic term, leading to the selection of smaller values in the final iterations. Our analysis establishes a convergence rate of $O(\frac{\ln T}{\sqrt{T}})$ for smooth non-convex functions without the Polyak-{\L}ojasiewicz condition. To evaluate the effectiveness of our approach, we conducted numerical experiments on image classification tasks using the FashionMNIST, and CIFAR10 datasets, and the results demonstrate significant improvements in accuracy, with enhancements of $0.5\%$ and $1.4\%$ observed, respectively, compared to the traditional $\frac{1}{\sqrt{t}}$ step size. The source code can be found at \\\url{https://github.com/Shamaeem/LNSQRTStepSize}.

该论文提出了一种新颖的方法，通过引入基于1/√t的修改衰减步长来提高随机梯度下降(SGD)算法的性能。所提出的步长整合了对数项，在最后的迭代中选择较小的值。通过分析，我们在非凸光滑函数无Polyak-Lojasiewicz条件的情况下，建立了收敛速度为O(ln T/√T)。为了评估我们的方法的有效性，我们在FashionMNIST和CIFAR10数据集上进行了图像分类任务的数值实验，结果显示与传统的1/√t步长相比，准确率明显提高，分别观察到0.5%和1.4%的增益。源代码可以在https://github.com/Shamaeem/LNSQRTStepSize找到。

增强型随机梯度下降算法的改进步长：收敛性和实验