BriefGPT.xyz
Aug, 2020
合作多智能体情景赌博的核方法
Kernel Methods for Cooperative Multi-Agent Contextual Bandits
HTML
PDF
Abhimanyu Dubey, Alex Pentland
TL;DR
本文研究了合作多智能体决策问题中的基于核的上下文平衡问题,提出了 Coop-KernelUCB 算法并在多个实验中验证其表现优于现有基准算法。
Abstract
cooperative multi-agent decision making
involves a group of agents cooperatively solving learning problems while communicating over a network with delays. In this paper, we consider the
kernelised contextual bandit prob
→