multi-objective reinforcement learning (MORL) is the generalization of
standard reinforcement learning (RL) approaches to solve sequential decision
making problems that consist of several, possibly conflicting, objectives.
Generally, in such formulations, there is no single optimal pol