BriefGPT.xyz
Mar, 2025
通过对比表征学习增强安全强化学习中的探索
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning
HTML
PDF
Duc Kien Doan, Bang Giang Le, Viet Cuong Ta
TL;DR
本研究解决了安全强化学习中探索与安全约束之间的平衡问题,提出了一种高效的状态表征学习方法,以应对稀疏奖励环境中的不充分探索。通过使用自编码器映射输入图像到隐层表示,并采用对比学习目标,研究显示该方法在保证安全性的同时,显著提高了探索效率。
Abstract
In
Safe Reinforcement Learning
, agent needs to balance between
Exploration
actions and safety constraints. Following this paradigm, domain transfer approaches learn a prior Q-function from the related environment
→