BriefGPT.xyz
Mar, 2024
线性约束在线LQG问题的策略优化的遗憾分析
Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG
HTML
PDF
Ting-Jui Chang, Shahin Shahrampour
TL;DR
在线优化方法可用于研究在线线性二次型调节器问题,本研究通过在线乐观牛顿法提供了一个基于函数序列的在线控制器,并利用后悔度量定义了算法的性能界限。
Abstract
Recent advancement in
online optimization
and control has provided novel tools to study online
linear quadratic regulator
(LQR) problems, where cost matrices are varying adversarially over time. However, the cont
→