BriefGPT.xyz
Feb, 2021
个性化联邦多臂老虎机
Federated Multi-armed Bandits with Personalization
HTML
PDF
Chengshuai Shi, Cong Shen, Jing Yang
TL;DR
提出了个性化联邦多臂老虎机(PF-MAB)的总体框架,研究了一个灵活平衡泛化和个性化的混合老虎机学习问题,并提出了个性化联邦上置信上界(PF-UCB)算法,在理论分析和实验方面都取得了良好效果。
Abstract
A general framework of
personalized federated multi-armed bandits
(PF-MAB) is proposed, which is a new bandit paradigm analogous to the
federated learning
(FL) framework in supervised learning and enjoys the feat
→