BriefGPT.xyz
Oct, 2023
偷袭计划对抗不完美观察者
Covert Planning against Imperfect Observers
HTML
PDF
Haoxiang Ma, Chongyang Shi, Shuo Han, Michael R. Dorothy, Jie Fu
TL;DR
隐秘规划研究使用随机动力学和不完美观察来实现最佳任务表现而不被检测到,本文引入了马尔可夫决策过程和近端策略梯度方法来解决这个问题。
Abstract
covert planning
refers to a class of constrained planning problems where an agent aims to accomplish a task with minimal information leaked to a passive observer to avoid detection. However, existing methods of
covert p
→