Offline reinforcement learning has emerged as a promising technology by enhancing its practicality through the use of pre-collected large datasets. Despite its practical benefits, most algorithm development research in offline reinforcement learning still relies on game tasks with synthetic datasets. To address such limitations, this paper provides autonomous driving datasets and benchmarks for offline reinforcement learning research. We provide 19 datasets, including real-world human driver's datasets, and seven popular offline reinforcement learning algorithms in three realistic driving scenarios. We also provide a unified decision-making process model that can operate effectively across different scenarios, serving as a reference framework in algorithm design. Our research lays the groundwork for further collaborations in the community to explore practical aspects of existing reinforcement learning methods. Dataset and codes can be found in https://sites.google.com/view/ad4rl.

本研究提供了自动驾驶数据集和离线强化学习算法的基准，其中包含19个数据集，包括真实世界的人类驾驶员数据集，并提供三种真实行驶场景下的七种流行的离线强化学习算法，同时提供了一个统一的决策过程模型作为算法设计的参考框架，为探索现有强化学习方法的实际方面奠定了基础。

AD4RL：用基于价值的数据集进行离线强化学习的自动驾驶基准