BriefGPT.xyz
Nov, 2022
Bayesian逆强化学习下的演示充分性自主评估
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
HTML
PDF
Tu Trinh, Daniel S. Brown
TL;DR
本文提出一种基于贝叶斯反向强化学习和风险价值的自我评估方法,使得能够从演示中学习的智能体能够计算其性能的高置信度界限,并使用这些界限确定何时具有充足数量的演示。
Abstract
In this paper we examine the problem of determining
demonstration sufficiency
for
ai agents
that learn from demonstrations: how can an AI agent self-assess whether it has received enough demonstrations from an ex
→