BriefGPT.xyz
Mar, 2022
强化学习中利用Rényi状态熵加速探索
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning
HTML
PDF
Mingqi Yuan, Man-on Pun, Dong Wang
TL;DR
为解决深度强化学习中的长期探索能力问题,本文提出了一种基于Rényi熵的新型内在奖励模块,并通过较广泛的模拟结果证明了其高于现有方案的性能。
Abstract
One of the most critical challenges in
deep reinforcement learning
is to maintain the long-term
exploration
capability of the agent. To tackle this problem, it has been recently proposed to provide
→