BriefGPT.xyz
Dec, 2023
通过核化多臂赌博机进行分布式优化
Distributed Optimization via Kernelized Multi-armed Bandits
HTML
PDF
Ayush Rai, Shaoshuai Mou
TL;DR
该研究论文提出了一种基于分布式优化和多臂赌博算法(Multi-armed bandit)的全分散算法(Multi-agent IGP-UCB),以最小化代理间的遗憾值,并在保护隐私的同时提供了改进的性能。
Abstract
multi-armed bandit algorithms
provide solutions for sequential decision-making where learning takes place by interacting with the environment. In this work, we model a
distributed optimization
problem as a multi-
→