Feb, 2024
概率演员 - 评论家:利用 PAC-Bayes 不确定性学习探索
Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes Uncertainty
Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir
TL;DRProbabilistic Actor-Critic (PAC) algorithm improves continuous control performance by integrating stochastic policies and critics, explicitly modeling critic uncertainty through PAC-Bayes analysis, and adapting exploration strategy in deep reinforcement learning.