将离线强化学习重新构建为回归问题

Jan, 2024

Reframing Offline Reinforcement Learning as a Regression Problem

Prajwal Koirala, Cody Fleming

TL;DR该研究将离线强化学习重新定义为一个可以用决策树解决的回归问题，通过梯度提升树可以实现快速训练和推理，同时对通用性进行了讨论。

Abstract

The study proposes the reformulation of offline reinforcement learning as a regression problem that can be solved with decision trees. Aim