BriefGPT.xyz
Mar, 2023
使用目标条件策略模拟基于图的规划
Imitating Graph-Based Planning with Goal-Conditioned Policies
HTML
PDF
Junsu Kim, Younggyo Seo, Sungsoo Ahn, Kyunghwan Son, Jinwoo Shin
TL;DR
该论文提出了一种基于图形规划算法和自我模仿的方法,通过提取子目标策略来优化目标目标策略,从而提高在长期任务中实现指定目标的样本效率。
Abstract
Recently,
graph-based planning algorithms
have gained much attention to solve
goal-conditioned reinforcement learning
(RL) tasks: they provide a sequence of subgoals to reach the target-goal, and the agents learn
→