BriefGPT.xyz
Jul, 2020
目标感知预测:学习如何模拟重要因素
Goal-Aware Prediction: Learning to Model What Matters
HTML
PDF
Suraj Nair, Silvio Savarese, Chelsea Finn
TL;DR
该论文提出了一种基于自监督学习的学习动力学模型,该模型可用于任务规划和策略学习,避免了视觉控制任务中由于真实环境的复杂度超过模型容量所导致的训练效率低的问题。
Abstract
learned dynamics models
combined with both
planning
and
policy learning algorithms
have shown promise in enabling artificial agents to lea
→