BriefGPT.xyz
Oct, 2022
任务分阶段:从示范中自动学习课程
Task Phasing: Automated Curriculum Learning from Demonstrations
HTML
PDF
Vaibhav Bajaj, Guni Sharon, Peter Stone
TL;DR
本文介绍了一种基于任务分阶段的机器学习方法,通过逐步提高任务复杂度并调节反馈信息,针对稀疏奖励问题下的强化学习进行探索,并取得了较好成果。
Abstract
Applying
reinforcement learning
(RL) to
sparse reward domains
is notoriously challenging due to insufficient guiding signals. Common techniques for addressing such domains include (1) learning from demonstrations
→