BriefGPT.xyz
Sep, 2018
Vulcan:用于大量几率受限 MDP 的蒙特卡罗算法,带有风险边界函数
Vulcan: A Monte Carlo Algorithm for Large Chance Constrained MDPs with Risk Bounding Functions
HTML
PDF
Benjamin J Ayton, Brian C Williams
TL;DR
本文提出了一种基于约束和蒙特卡罗树搜索的方法Vulcan,用于解决在可能存在危险结果或未知环境情况下的决策问题,并在计算效率和结果优化度等方面取得了较好的结果。
Abstract
Chance Constrained
markov decision processes
maximize reward subject to a bounded
probability of failure
, and have been frequently applied for planning with potentially dangerous outcomes or unknown environments.
→