基于逻辑的奖励塑造多智能体强化学习

Jun, 2022

基于逻辑的奖励塑造多智能体强化学习

Logic-based Reward Shaping for Multi-Agent Reinforcement Learning

Ingy ElSayed-Aly, Lu Feng

TL;DR本研究探讨了基于逻辑的多智能体强化学习中的奖励设计问题，并提出了一种可扩展的半集中式逻辑奖励设计方法，以应对任务中多智能体数量增加的问题。

Abstract

reinforcement learning (RL) relies heavily on exploration to learn from its environment and maximize observed rewards. Therefore, it is essential to design a reward function that guarantees optimal learning from the received experience. Previous work has combined →