BriefGPT.xyz
May, 2022
CAMEO: 基於好奇心擴增的都市探索最優策略
CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies
HTML
PDF
Mohamed Alami Chehboune, Fernando Llorente, Rim Kaddah, Luca Martino, Jesse Read
TL;DR
本文介绍了一种基于好奇心的Metropolis算法(CAMEO),用于采样解决环境控制问题的多个最优策略,使其展现出不同的行为和风险属性,为实用和可解释性应用提供基础,也为学习多个最优策略的分布打下了第一步基础。
Abstract
reinforcement learning
has drawn huge interest as a tool for solving optimal control problems. Solving a given problem (task or environment) involves converging towards an optimal policy. However, there might exist multiple
→