BriefGPT.xyz
Oct, 2018
梯度下降优化在策略梯度方法中的实证分析:我的最优解去哪了?
Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods
HTML
PDF
Peter Henderson, Joshua Romoff, Joelle Pineau
TL;DR
本论文研究不同的梯度下降优化方法对深度强化学习的影响,并发现适应性优化器有一个有效学习率的狭窄窗口,同时动量的有效性会因环境属性而异,为深度强化学习算法的优化提供了新的思路和建议。
Abstract
Recent analyses of certain
gradient descent optimization
methods have shown that performance can degrade in some settings - such as with stochasticity or implicit momentum. In
deep reinforcement learning
(Deep RL
→