BriefGPT.xyz
May, 2018
一种基于李亚普诺夫函数的安全强化学习方法
A Lyapunov-based Approach to Safe Reinforcement Learning
HTML
PDF
Yinlam Chow, Ofir Nachum, Edgar Duenez-Guzman, Mohammad Ghavamzadeh
TL;DR
提出了一种基于Lyapunov方法的安全强化学习算法,该算法可在保证行为策略安全的前提下,有效地平衡约束满足和性能优化。
Abstract
In many real-world
reinforcement learning
(RL) problems, besides optimizing the main objective function, an agent must concurrently avoid violating a number of
constraints
. In particular, besides optimizing perfo
→