BriefGPT.xyz
Sep, 2023
逆强化学习:从示范中推导与适应双足行走奖励学习
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations via Inverse Reinforcement Learning
HTML
PDF
Feiyang Wu, Zhaoyuan Gu, Hanran Wu, Anqi Wu, Ye Zhao
TL;DR
通过逆强化学习(IRL)技术解决复杂地形上的双足机器人行走问题,并提出用于学习专家奖励函数的算法,通过非线性函数逼近揭示专家的运动策略,同时通过训练推断奖励函数,提高双足机器人在未知地形上的行走性能。
Abstract
Enabling
bipedal walking robots
to
learn
how to
maneuver
over highly uneven, dynamically changing terrains is challenging due to the compl
→