BriefGPT.xyz
Oct, 2022
指数可索引性对Whittle算法不足:无静止赌博机改进的近似最优算法
Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits
HTML
PDF
Abheek Ghosh, Dheeraj Nagaraj, Manish Jain, Milind Tambe
TL;DR
本论文探讨了不安定多臂赌博机的规划问题,提出了一种基于均场方法的规划算法来获得近似最优策略。通过实验分析,该算法在实际应用中表现优异且无需外部超参数。
Abstract
We study the problem of planning
restless multi-armed bandits
(RMABs) with multiple actions. This is a popular model for
multi-agent systems
with applications like multi-channel communication, monitoring and mach
→