BriefGPT.xyz
Sep, 2018
通过策略梯度优化手动设计的符号接地和高层规划
Refining Manually-Designed Symbol Grounding and High-Level Planning by Policy Gradients
HTML
PDF
Takuya Hiraoka, Takashi Onishi, Takahisa Imagawa, Yoshimasa Tsuruoka
TL;DR
该论文提出了一种自动细化符号接地函数和高层规划器以减少人工设计这些模块的人工工作量的框架,利用基于手动设计知识库的半马尔科夫决策过程建模符号接地和高层规划,并应用策略梯度方法来细化这些模块以产生适当的可解释规划。
Abstract
hierarchical planners
that produce interpretable and appropriate plans are desired, especially in its application to supporting human decision making. In the typical development of the
hierarchical planners
, high
→