BriefGPT.xyz
May, 2017
层次强化学习中的特征控制作为内在动机
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
HTML
PDF
Nat Dilokthanakul, Christos Kaplanis, Nick Pawlowski, Murray Shanahan
TL;DR
本文介绍了一种通用的子目标类别,应用于端到端层次强化学习系统中,可用于处理含有稀疏奖励的Montezuma的复仇等Atari游戏。该方法引入了一组时间扩展行动,或选项,以及对应的子目标。
Abstract
The problem of
sparse rewards
is one of the hardest challenges in contemporary
reinforcement learning
. Hierarchical
reinforcement learning
→