BriefGPT.xyz
Dec, 2023
TRC:用于安全强化学习的信任区域条件风险价值
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
HTML
PDF
Dohyeong Kim, Songhwai Oh
TL;DR
提出了一种以条件风险为约束的信赖区域安全强化学习方法(TRC),通过近似上界和使用次问题训练策略,实现在安全约束下达到更优性能的有效导航任务。
Abstract
As
safety
is of paramount importance in robotics,
reinforcement learning
that reflects
safety
, called
→