针对对手感知的去中心化网络多智能体强化学习算法

May, 2023

针对对手感知的去中心化网络多智能体强化学习算法

An Algorithm For Adversary Aware Decentralized Networked MARL

Soumajyoti Sarkar

TL;DR研究了去中心化的多智能体强化学习算法，引入了对抗性智能体对共识更新的漏洞，并提出了一种算法，使得非对抗性智能体在受限制的情况下达成共识。

Abstract

decentralized multi-agent reinforcement learning (MARL) algorithms have become popular in the literature since it allows heterogeneous agents to have their own reward functions as opposed to canonical multi-agent Markov Decision Process (MDP) settings which assume common reward functio