BriefGPT.xyz
Oct, 2023
预训练和微调生成流网络
Pre-Training and Fine-Tuning Generative Flow Networks
HTML
PDF
Ling Pan, Moksh Jain, Kanika Madan, Yoshua Bengio
TL;DR
发展了一种无监督预训练的 GFlowNets 方法,通过预训练 OC-GFN 模型,可以在下游任务中直接提取适应新奖励函数的策略,并证明了该方法在发现模式和适应下游任务方面的有效性。
Abstract
generative flow networks
(
gflownets
) are amortized samplers that learn stochastic policies to sequentially generate compositional objects from a given unnormalized reward distribution. They can generate diverse s
→