BriefGPT.xyz
Nov, 2023
基于学习的线性二次高斯控制附加勘探的遗憾分析
Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration
HTML
PDF
Archith Athrey, Othmane Mazhar, Meichen Guo, Bart De Schutter, Shengling Shi
TL;DR
通过数值模拟,我们分析了一种称为LQG-NAIVE的方法在控制未知部分可观察系统时产生的后悔效应,提出了扩展到“闭环”设置的LQG-IF2E方法,并验证了其与LQG-NAIVE的竞争性能。
Abstract
In this paper, we analyze the
regret
incurred by a computationally efficient exploration strategy, known as
naive exploration
, for controlling unknown partially observable systems within the Linear Quadratic Gaus
→