BriefGPT.xyz
Nov, 2018
利用目标条件策略学习可操作表示
Learning Actionable Representations with Goal-Conditioned Policies
HTML
PDF
Dibya Ghosh, Abhishek Gupta, Sergey Levine
TL;DR
本文研究功能性显著表征的强化学习方法,可以用于改善稀疏奖励问题的探索、实现具有长期视野的分层强化学习和作为下游任务的学习策略的状态表征。通过在多个虚拟环境中对比实验,表明该方法在表征学习、探索和分层强化学习方面具有优势。
Abstract
representation learning
is a central challenge across a range of machine learning areas. In
reinforcement learning
, effective and functional representations have the potential to tremendously accelerate learning
→