BriefGPT.xyz
May, 2024
自我先见力:智能体视觉动作预测作为强化学习的规范化方法
Ego-Foresight: Agent Visuomotor Prediction as Regularization for RL
HTML
PDF
Manuel S. Nunes, Atabak Dehban, Yiannis Demiris, José Santos-Victor
TL;DR
以运动预测为基础的自我监督方法 Ego-Foresight 可提高强化学习算法的效果和性能。
Abstract
Despite the significant advancements in
deep reinforcement learning
(RL) observed in the last decade, the amount of
training experience
necessary to learn effective policies remains one of the primary concerns bo
→