BriefGPT.xyz
Jun, 2019
状态边际匹配优化探索效率
Efficient Exploration via State Marginal Matching
HTML
PDF
Lisa Lee, Benjamin Eysenbach, Emilio Parisotto, Eric Xing, Sergey Levine...
TL;DR
通过 State Marginal Matching (SMM) 框架,将 Reinforcement Learning 中的 Exploration 问题转化为学习策略,以匹配给定目标状态分布。使用 SMM 框架构建的算法,能够更快速地进行探索,较以前的探索方法更快地适应新任务。
Abstract
To solve tasks with sparse rewards,
reinforcement learning
algorithms must be equipped with suitable
exploration
techniques. However, it is unclear what underlying objective is being optimized by existing
→