BriefGPT.xyz
Nov, 2020
反向受限强化学习
Inverse Constrained Reinforcement Learning
HTML
PDF
Usman Anwar, Shehryar Malik, Alireza Aghasi, Ali Ahmed
TL;DR
研究使用强化学习智能体从行为演示中学习约束以及将其迁移到具有不同形态和奖励功能的新智能体的方法,建立了能够在高维度完全无模型的情况下学习任意 Markovian 约束的框架,并且该方法与之前的工作相比,在离散设置、特定类型约束和环境转移动力学等方面表现更好。
Abstract
Standard
reinforcement learning
(RL) algorithms train agents to maximize given reward functions. However, many real-world applications of RL require agents to also satisfy certain
constraints
which may, for examp
→