BriefGPT.xyz
Feb, 2023
基于状态的安全强化学习:一项调查
State-wise Safe Reinforcement Learning: A Survey
HTML
PDF
Weiye Zhao, Tairan He, Rui Chen, Tianhao Wei, Changliu Liu
TL;DR
本文综述了在强化学习中解决状态限制问题的现存方法并比较了它们在安全性、可伸缩性、奖励表现等方面的差异和权衡,同时总结了当前方法的局限性并探讨了未来的研究方向。
Abstract
Despite the tremendous success of
reinforcement learning
(RL) algorithms in simulation environments, applying RL to real-world applications still faces many challenges. A major concern is safety, in another word,
constr
→