BriefGPT.xyz
Oct, 2020
从非线性观测学习线性二次调节器
Learning the Linear Quadratic Regulator from Nonlinear Observations
HTML
PDF
Zakaria Mhammedi, Dylan J. Foster, Max Simchowitz, Dipendra Misra, Wen Sun...
TL;DR
本研究引入了一种新的连续控制问题设置,称为RichLQR,使用低维连续潜在状态和高维非线性观测来实现样本高效的学习,并建立了一种新算法RichID,该算法无需了解编码器的具体信息,仅使用最小二乘回归预测即可实现近似最优控制。
Abstract
We introduce a new problem setting for
continuous control
called the
lqr
with
rich observations
, or RichLQR. In our setting, the environme
→