BriefGPT.xyz
Jan, 2024
统一的不确定性感知探索:结合认知和随机不确定性
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
HTML
PDF
Parvin Malekzadeh, Ming Hou, Konstantinos N. Plataniotis
TL;DR
我们提出了一种基于分布式强化学习的算法,通过估计参数化回报分布来统一估计aleatory和epistemic不确定性,并量化两种不确定性的综合效应以实现风险敏感的勘探。实证结果表明,我们的方法在具有勘探和风险挑战的任务中优于替代方法。
Abstract
Exploration is a significant challenge in practical
reinforcement learning
(RL), and
uncertainty-aware exploration
that incorporates the quantification of epistemic and
→