BriefGPT.xyz
Apr, 2023
风险敏感和鲁棒的基于模型的强化学习和规划
Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning
HTML
PDF
Marc Rigter
TL;DR
本研究主要关注序列决策算法中的不确定性和风险问题,通过探索规划和强化学习两种方法,尤其是面向基于模型算法的研究,旨在缓解epistemic和aleatoric不确定性问题。
Abstract
Many
sequential decision-making
problems that are currently automated, such as those in manufacturing or recommender systems, operate in an environment where there is either little
uncertainty
, or zero
→