BriefGPT.xyz
Apr, 2019
具有未知转移模型的确定性 马尔可夫决策过程中高效安全的探索
Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models
HTML
PDF
Erdem Bıyık, Jonathan Margoliash, Shahrouz Ryan Alimo, Dorsa Sadigh
TL;DR
本文提出一种基于Lipschitz连续性的确定性马尔可夫决策过程未知转移模型的安全探索算法,该算法通过优化减少探索安全空间所需的操作数量,并在导航任务的仿真中与基线方法进行了性能比较。
Abstract
We propose a
safe exploration algorithm
for
deterministic markov decision processes
with unknown transition models. Our algorithm guarantees safety by leveraging
→