深度强化学习控制稳定性的模块化框架

Apr, 2023

深度强化学习控制稳定性的模块化框架

A modular framework for stabilizing deep reinforcement learning control

Nathan P. Lawrence, Philip D. Loewen, Shuyuan Wang, Michael G. Forbes, R. Bhushan Gopaluni

TL;DR本文提出了一种基于深度强化学习优势和Youla-Kucera参数化的稳定性保证相结合的反馈控制器设计框架，并采用基于数据驱动内部模型的替代Youla-Kucera参数化方法。使用神经网络表示参数化一组非线性稳定算子，实现了与标准深度学习库的无缝集成，并在两罐系统的真实模拟中展示了这种方法。

Abstract

We propose a framework for the design of feedback controllers that combines the optimization-driven and model-free advantages of deep reinforcement learning with the stability guarantees provided by using the