BriefGPT.xyz
Jun, 2023
深度强化学习中的优化器重置:实证研究
Resetting the Optimizer in Deep RL: An Empirical Study
HTML
PDF
Kavosh Asadi, Rasool Fakoor, Shoham Sabach
TL;DR
本研究旨在研究在深度强化学习中近似于最优值函数的问题。通过重置优化器的内部参数,可以提高模型在Atari测试中的表现。
Abstract
We focus on the task of approximating the optimal value function in
deep reinforcement learning
. This iterative process is comprised of approximately solving a sequence of
optimization
problems where the objectiv
→