BriefGPT.xyz
Oct, 2022
CEIP: 结合显式和隐式先验知识的强化学习
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations
HTML
PDF
Kai Yan, Alexander G. Schwing, Yu-Xiong Wang
TL;DR
本文提出一种名为CEIP的方法,通过使用多个并行的流来形成一个单一复杂的先验分布,并使用有效的显式检索和推进机制来改善强化学习中通过利用内在和外在先验信息来解决稀疏奖励的问题。
Abstract
Although
reinforcement learning
has found widespread use in dense reward settings, training autonomous agents with
sparse rewards
remains challenging. To address this difficulty, prior work has shown promising re
→