BriefGPT.xyz
Mar, 2024
离线技能扩散的稳健策略学习
Robust Policy Learning via Offline Skill Diffusion
HTML
PDF
Woo Kyung Kim, Minjong Yoo, Honguk Woo
TL;DR
通过离线数据集学习的、能够在不同领域中应用的多功能技能是一项全新的离线技能学习框架 DuSkill 的核心,通过引导式扩散模型生成可以应用于任务的多功能技能,从而增加不同领域中策略学习的稳健性。
Abstract
skill-based reinforcement learning
(RL) approaches have shown considerable promise, especially in solving long-horizon tasks via
hierarchical structures
. These skills, learned task-agnostically from offline datas
→