BriefGPT.xyz
May, 2018
稀疏奖励确定性马尔可夫决策过程的快速在线精确解法
Fast Online Exact Solutions for Deterministic MDPs with Sparse Rewards
HTML
PDF
Joshua R. Bertram, Xuxi Yang, Peng Wei
TL;DR
介绍了一种新的方法,确切高效地解决了具有稀疏奖励来源的确定性连续MDP问题,可以提高在机器人和无人系统等领域的应用价值,减少计算复杂度。
Abstract
markov decision processes
(
mdps
) are a mathematical framework for modeling
sequential decision making
under uncertainty. The classical app
→