BriefGPT.xyz
Feb, 2024
使用条件扩散模型拼接子轨迹以实现目标条件离线强化学习
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
HTML
PDF
Sungyoon Kim, Yunseon Choi, Daiki E. Matsunaga, Kee-Eung Kim
TL;DR
使用SSD模型,基于离线数据集利用条件扩散模型生成高质量计划,成功将离线数据中的子优化轨迹段拼接起来,并在GCRL标准基准任务中取得了领先水平的性能。
Abstract
Offline Goal-Conditioned
reinforcement learning
(
offline gcrl
) is an important problem in RL that focuses on acquiring diverse goal-oriented skills solely from pre-collected behavior datasets. In this setting, th
→