BriefGPT.xyz
Mar, 2017
反向风险敏感的强化学习
Risk-Sensitive Inverse Reinforcement Learning via Gradient Methods
HTML
PDF
Lillian J. Ratliff, Eric Mazumdar
TL;DR
本研究提出了一种基于人类决策模型的梯度下降型反向强化学习算法,用于解决马尔可夫决策过程中有风险感知的智能体反向强化学习问题,并结合两个案例进行了性能展示。
Abstract
We address the problem of
inverse reinforcement learning
in Markov decision processes where the agent is risk-sensitive. In particular, we model
risk-sensitivity
in a reinforcement learning framework by making us
→