BriefGPT.xyz
Feb, 2023
间歇可观察的马尔科夫决策过程
Intermittently Observable Markov Decision Processes
HTML
PDF
Gongpu Chen, Soung-Chang Liew
TL;DR
本文研究了在不稳定状态信息下的MDP,提出了一种基于树组织结构和值迭代算法的有限状态近似方法来寻找最优策略。
Abstract
This paper investigates
mdps
with
intermittent state information
. We consider a scenario where the controller perceives the state information of the process via an unreliable communication channel. The transmissi
→