BriefGPT.xyz
Jan, 2022
通过学习特征诱导奖励学习中的结构
Inducing Structure in Reward Learning by Learning Features
HTML
PDF
Andreea Bobu, Marius Wiggert, Claire Tomlin, Anca D. Dragan
TL;DR
本研究探究了奖励学习在机器人自适应行为学习中的应用,结合人类输入实现对特征的分步学习,并应用于机器人操作中。该方法在提高奖励学习效率和推广性方面优于传统的奖励学习方法。
Abstract
reward learning
enables robots to learn adaptable behaviors from
human input
. Traditional methods model the reward as a linear function of hand-crafted features, but that requires specifying all the relevant feat
→