BriefGPT.xyz
Sep, 2024
LoopSR: 循环模拟与现实在四足机器人终身策略适应中的应用
LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots
HTML
PDF
Peilin Wu, Weiji Xie, Jiahang Cao, Hang Lai, Weinan Zhang
TL;DR
本文针对四足机器人在动态环境中的策略适应问题,提出了一种名为LoopSR的终身政策适应框架。通过利用基于变压器的编码器将真实世界轨迹投影到潜在空间,并在模拟中重建真实环境,从而显著提高了数据效率,使得策略在模拟到现实的应用中表现卓越。
Abstract
Reinforcement Learning
(RL) has shown its remarkable and generalizable capability in legged locomotion through
Sim-to-Real Transfer
. However, while adaptive methods like domain randomization are expected to make
→