BriefGPT.xyz
Jul, 2019
具有动态感知的无监督技能发现
Dynamics-Aware Unsupervised Discovery of Skills
HTML
PDF
Archit Sharma, Shixiang Gu, Sergey Levine, Vikash Kumar, Karol Hausman
TL;DR
通过结合基于模型的和基于非模型的机器学习方法,本文提出了一种无监督学习算法DADS,用于发现易于预测的行为和学习它们的动态,提高了规划算法的效率和性能。
Abstract
Conventionally,
model-based reinforcement learning
(MBRL) aims to learn a global model for the dynamics of the environment. A good model can potentially enable
planning algorithms
to generate a large variety of b
→