BriefGPT.xyz
Jan, 2025
强化学习中的视野泛化
Horizon Generalization in Reinforcement Learning
HTML
PDF
Vivek Myers, Catherine Ji, Benjamin Eysenbach
TL;DR
本研究针对目标条件下的强化学习,聚焦于视野泛化问题,而非传统的随机增强和领域随机化。我们提出了一种新颖的方法,通过学习能够适应不同目标距离的策略,实验结果表明在训练了接近的目标后,该策略能够有效地达到远距离目标,这为强化学习中的泛化和规划提供了新的视角和方法。
Abstract
We study
Goal-Conditioned
RL through the lens of
Generalization
, but not in the traditional sense of random augmentations and domain randomization. Rather, we aim to learn goal-directed policies that generalize w
→