多目标强化学习的最大最小公式：从理论到无模型算法

Jun, 2024

多目标强化学习的最大最小公式：从理论到无模型算法

The Max-Min Formulation of Multi-Objective Reinforcement Learning: From Theory to a Model-Free Algorithm

Giseung Park, Woohyeon Byeon, Seongmin Kim, Elad Havakuk, Amir Leshem...

TL;DR本文研究多目标强化学习在应对多个优化目标的实际问题中的应用，采用最大最小框架从公平的角度出发并在该框架下提出了相关理论和实用的无模型算法。所提出的理论在多目标强化学习方面具有理论上的突破，而所提出的算法在性能上显著优于现有的基准方法。

Abstract

In this paper, we consider multi-objective reinforcement learning, which arises in many real-world problems with multiple optimization goals. We approach the problem with a max-min framework focusing on