BriefGPT.xyz
Jun, 2024
探索前进:在深度强化学习中利用探索进行泛化
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
HTML
PDF
Max Weltevrede, Felix Kaubek, Matthijs T. J. Spaan, Wendelin Böhmer
TL;DR
提供一种新的方法Explore-Go,通过增加代理训练的状态数目,从而有效地增加代理的起始状态分布,以提高强化学习中的泛化性能。
Abstract
One of the remaining challenges in
reinforcement learning
is to develop agents that can generalise to novel scenarios they might encounter once deployed. This challenge is often framed in a
multi-task setting
whe
→