BriefGPT.xyz
Apr, 2023
通过奖励引导探索实现可控扩散模型
Towards Controllable Diffusion Models via Reward-Guided Exploration
HTML
PDF
Hengtong Zhang, Tingyang Xu
TL;DR
本文提出了一种名为RGDM的模型,通过强化学习(RL)引导扩散模型的训练阶段,从而实现对样本生成的控制。在3D形状和分子生成任务上的实验表明,该模型相较于现有的条件扩散模型具有显著的改进。
Abstract
By formulating data samples' formation as a
markov denoising process
,
diffusion models
achieve state-of-the-art performances in a collection of tasks. Recently, many variants of
→