BriefGPT.xyz
Feb, 2023
理解每步回放不同数量的影响
Understanding the effect of varying amounts of replay per step
HTML
PDF
Animesh Kumar Paul, Videh Raj Nema
TL;DR
本研究从经验重放和模型的角度出发,对Deep Q-Network算法中回放量的变化对样本效率和算法健壮性的影响进行了系统性研究,在Mountain Car环境下获得了提高样本效率、降低性能波动、提高算法鲁棒性的结果,为算法应用方面提供了新的思路。
Abstract
model-based reinforcement learning
uses models to plan, where the predictions and policies of an agent can be improved by using more computation without additional data from the environment, thereby improving
sample eff
→