BriefGPT.xyz
Jul, 2024
面向策略学习的文本感知扩散
Text-Aware Diffusion for Policy Learning
HTML
PDF
Calvin Luo, Mandy He, Zilai Zeng, Chen Sun
TL;DR
使用文本条件的扩散模型进行密集的无示范奖励信号计算,以从自然语言中学习零样本目标实现和持续运动行为的策略学习,并在机器人操纵任务中竞争性表现。
Abstract
Training an agent to achieve particular goals or perform desired behaviors is often accomplished through
reinforcement learning
, especially in the absence of expert demonstrations. However, supporting novel goals or behaviors through
→