关于无通信延迟的异步随机逼近稳定性的注记

Dec, 2023

A Note on Stability in Asynchronous Stochastic Approximation without Communication Delays

Huizhen Yu, Yi Wan, Richard S. Sutton

TL;DR本文研究没有通信延迟的异步随机逼近算法，主要贡献是通过扩展Borkar和Meyn的方法来进行这些算法的稳定性证明，我们还从稳定性结果中导出收敛性结果，并讨论其在重要的平均奖励强化学习问题中的应用。

Abstract

In this paper, we study asynchronous stochastic approximation algorithms without communication delays. Our main contribution is a stability proof