BriefGPT.xyz
Jun, 2023
离线强化学习的时态条件引导指导下的指导扩散器
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
HTML
PDF
Jifeng Hu, Yanchao Sun, Sili Huang, SiYuan Guo, Hechang Chen...
TL;DR
本文提出了一种基于时间条件的扩散模型(Temporally-Composable Diffuser),该模型可以从交互序列中提取时间信息,并将其用于指导生成,以在离线强化学习任务中实现更好的性能。
Abstract
Recent works have shown the potential of
diffusion models
in computer vision and natural language processing. Apart from the classical supervised learning fields,
diffusion models
have also shown strong competiti
→