BriefGPT.xyz
Oct, 2023
安全体育场:统一的安全强化学习基准
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
HTML
PDF
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang...
TL;DR
这篇论文介绍了一个名为Safety-Gymnasium的环境套件和一个名为Safe Policy Optimization的算法库,其中包含了16种最先进的安全强化学习算法,旨在促进安全性能的评估和比较,并推动强化学习在更安全、更可靠和负责任的实际应用中的发展。
Abstract
artificial intelligence
(AI) systems possess significant potential to drive societal progress. However, their deployment often faces obstacles due to substantial safety concerns.
safe reinforcement learning
(Safe
→