BriefGPT.xyz
Sep, 2018
深度强化学习中的确定性实现,实现可重复性
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
HTML
PDF
Prabhat Nagarajan, Garrett Warnell, Peter Stone
TL;DR
本文研究了深度强化学习中训练的不确定性问题,并通过确定性实现来控制其表现差异,实验结果表明确定性实现能有效提高智能体的性能表现,并且对于结果的精确复现也具有重要作用。
Abstract
While
deep reinforcement learning
(DRL) has led to numerous successes in recent years, reproducing these successes can be extremely challenging. One reproducibility challenge particularly relevant to DRL is
nondetermini
→