The mean-field Langevin dynamics (MFLD) is a nonlinear generalization of the
Langevin dynamics that incorporates a distribution-dependent drift, and it
naturally arises from the optimization of two-layer neural networks via (noisy)
gradient descent. Recent works have shown that MFLD globally minimizes an
entropy-regularized convex functional in the space of measures. However, all
prior analyses assumed the infinite-particle or continuous-time limit, and
cannot handle stochastic gradient updates. We provide an general framework to
prove a uniform-in-time propagation of chaos for MFLD that takes into account
the errors due to finite-particle approximation, time-discretization, and
stochastic gradient approximation. To demonstrate the wide applicability of
this framework, we establish quantitative convergence rate guarantees to the
regularized global optimal solution under (i) a wide range of learning problems
such as neural network in the mean-field regime and MMD minimization, and (ii)
different gradient estimators including SGD and SVRG. Despite the generality of
our results, we achieve an improved convergence rate in both the SGD and SVRG
settings when specialized to the standard Langevin dynamics.

本文提出了一个新的框架来证明具有有限粒子逼近，时间离散化和随机梯度逼近误差的 MFLD 的混沌传播具有时间一致性，并在学习问题和不同梯度估计器的广泛范围内建立了量化的收敛速率保证，包括 SGD 和 SVRG 算法。

均场 Langevin 动力学的收敛性：时间和空间离散化、随机梯度和方差缩减

Convergence of mean-field Langevin dynamics: Time and space  discretization, stochastic gradient, and variance reduction

The recent mean field game (MFG) formalism facilitates otherwise intractable
computation of approximate Nash equilibria in many-agent settings. In this
paper, we consider discrete-time finite MFGs subject to finite-horizon
objectives. We show that all discrete-time finite MFGs with non-constant fixed
point operators fail to be contractive as typically assumed in existing MFG
literature, barring convergence via fixed point iteration. Instead, we
incorporate entropy-regularization and Boltzmann policies into the fixed point
iteration. As a result, we obtain provable convergence to approximate fixed
points where existing methods fail, and reach the original goal of approximate
Nash equilibria. All proposed methods are evaluated with respect to their
exploitability, on both instructive examples with tractable exact solutions and
high-dimensional problems where exact methods become intractable. In
high-dimensional scenarios, we apply established deep reinforcement learning
methods and empirically combine fictitious play with our approximations.

本文研究了离散时间有限 MFG 问题，通过使用熵正则化和 Boltzmann 策略使得固定点迭代收敛到近似固定点，同时提供了在高维场景下使用的近似 Nash 均衡算法以及结合虚拟博弈的深度强化学习方法。