BriefGPT.xyz
Jun, 2023
非线性系统中基于模型的RL的最优探索
Optimal Exploration for Model-Based RL in Nonlinear Systems
HTML
PDF
Andrew Wagenmaker, Guanya Shi, Kevin Jamieson
TL;DR
本研究针对非线性动态系统的控制问题,提出了一种通过最小化控制器损失和在任意系统中都能达到最佳实例速率的算法,该算法通过策略优化来实现优化实验设计,从而降低系统的参数不确定性。
Abstract
Learning to control unknown
nonlinear dynamical systems
is a fundamental problem in
reinforcement learning
and control theory. A commonly applied approach is to first explore the environment (exploration), learn
→