BriefGPT.xyz
Jun, 2020
生态学强化学习
Ecological Reinforcement Learning
HTML
PDF
John D. Co-Reyes, Suvansh Sanjeev, Glen Berseth, Abhishek Gupta, Sergey Levine
TL;DR
本文讨论了针对非情节式、奖励稀疏的强化学习任务中的环境特性,如何应用“环境塑形”和“环境动态性”等方法来提升学习效果,并通过实验验证了这些方法的有效性。
Abstract
Much of the current work on
reinforcement learning
studies episodic settings, where the agent is reset between trials to an initial state distribution, often with well-shaped reward functions.
non-episodic settings
→