BriefGPT.xyz
Nov, 2018
稳健MDP的严格贝叶斯模糊集
Tight Bayesian Ambiguity Sets for Robust MDPs
HTML
PDF
Reazul Hasan Russel, Marek Petrik
TL;DR
本文提出 RSVF 解决了传统 RO-MDP 方法计算策略过于保守的问题,该方法使用贝叶斯先验、优化模糊度集的大小和位置,并放宽了置信区间的要求,同时保证了安全性和实际应用价值。
Abstract
Robustness is important for sequential decision making in a stochastic dynamic environment with uncertain probabilistic parameters. We address the problem of using
robust mdps
(RMDPs) to compute policies with provable
w
→