BriefGPT.xyz
Jun, 2019
信息受限基元的竞争性集成强化学习
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
HTML
PDF
Anirudh Goyal, Shagun Sodhani, Jonathan Binas, Xue Bin Peng, Sergey Levine...
TL;DR
通过信息理论机制,提出了一种基于分解原语的策略输入的设计方法,这种方法比扁平和分层策略的泛化性都更好。
Abstract
reinforcement learning
agents that operate in diverse and complex environments can benefit from the structured decomposition of their behavior. Often, this is addressed in the context of hierarchical
reinforcement learn
→