BriefGPT.xyz
Jul, 2023
多智能体情境赌博机制中的Epoch-Greedy鲁棒性分析
On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms
HTML
PDF
Yinglun Xu, Bhuvesh Kumar, Jacob Abernethy
TL;DR
研究如何在多臂赌博机制(例如PPC拍卖)中有效地学习,解决诱导真实出价行为(激励)、用户个性化(上下文)和点击模式诱导(污损)三个挑战。该研究提出一种在环境和污损情况下表现良好的上下文多臂赌博算法。
Abstract
Efficient learning in
multi-armed bandit mechanisms
such as pay-per-click (PPC) auctions typically involves three challenges: 1) inducing truthful bidding behavior (incentives), 2) using
personalization
in the us
→