Julien Perez, Denys Proux, Claude Roux, Michael Niemaz
TL;DR本研究介绍了一个基于自然语言生成奖励和目标函数的机器人操作自动化训练方法,运用Goal-conditioned 和 MTRL技术,使用Large Language Models,针对机器人操作的可扩展性问题进行了实验验证。
Abstract
goal-conditioned and multi-task reinforcement learning (GCRL and MTRL) address numerous problems related to robot learning, including locomotion, navigation, and manipulation scenarios. Recent works focusing on l