BriefGPT.xyz
Feb, 2024
基于插值的策略扩散行为细化
Behavioral Refinement via Interpolant-based Policy Diffusion
HTML
PDF
Kaiqi Chen, Eugene Lim, Kelvin Lin, Yiyang Chen, Harold Soh
TL;DR
这篇论文通过使用信息源策略,提出了一种名为BRIDGER的方法,在模仿学习任务中优于现有的扩散策略,并在设计方面进行了进一步分析。
Abstract
imitation learning
empowers artificial agents to mimic behavior by learning from demonstrations. Recently,
diffusion models
, which have the ability to model high-dimensional and multimodal distributions, have sho
→