BriefGPT.xyz
Sep, 2022
图形价值迭代
Graph Value Iteration
HTML
PDF
Dieqiao Feng, Carla P. Gomes, Bart Selman
TL;DR
该论文提出了一种基于图值迭代的领域无关方法,通过利用局部搜索空间的图结构提供更多的信息学习信号,实现了解决规划任务的目标状态,以及通过一种课程策略来平滑学习过程。
Abstract
In recent years,
deep reinforcement learning
(RL) has been successful in various combinatorial search domains, such as two-player games and scientific discovery. However, directly applying deep RL in
planning
dom
→