Feb, 2024
概率演员-评论家:利用PAC-Bayes不确定性学习探索
Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes
Uncertainty
TL;DRProbabilistic Actor-Critic (PAC) algorithm improves continuous control performance by integrating stochastic policies and critics, explicitly modeling critic uncertainty through PAC-Bayes analysis, and adapting exploration strategy in deep reinforcement learning.