BriefGPT.xyz
Jun, 2023
自适应集成 Q-学习: 通过误差反馈减小估计偏差
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
HTML
PDF
Hang Wang, Sen Lin, Junshan Zhang
TL;DR
通过测试发现Adaptive Ensemble Q-learning(AdaEQ)集成模型在MuJoCo基准测试中能够提高学习性能,该模型结合了模型识别自适应控制(MIAC)来实现有效的集成尺寸自适应,并通过逼近误差表征来灵活控制集成尺寸。
Abstract
The
ensemble method
is a promising way to mitigate the overestimation issue in
q-learning
, where multiple
function approximators
are used
→