多拷贝强化学习代理

Sep, 2023

Multicopy Reinforcement Learning Agents

Alicia P. Wolfe, Oliver Diamond, Remi Feuerman, Magdalena Kisielinska, Brigitte Goeler-Slough...

TL;DR该论文研究了一种新型的多智能体问题，其中一个智能体通过复制自身来更好或更高效地完成单一智能体任务。我们提出了一种学习算法，用于解决多重复制问题，它利用价值函数的结构有效地学习如何平衡添加额外复制的优势和成本。

Abstract

This paper examines a novel type of multi-agent problem, in which an agent makes multiple identical copies of itself in order to achieve a single agent task better or more efficiently. This strategy improves performance if the environment is noisy and the task is sometimes unachievable