BriefGPT.xyz
Jun, 2019
不完整信息下随机赌博机的内在鲁棒性对策略操纵
The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation
HTML
PDF
Zhe Feng, David C. Parkes, Haifeng Xu
TL;DR
研究了在自利的情况下,三种常见的赌博算法UCB, ε-Greedy和Thompson Sampling 对策略行为的适应性,为应用于经济学中的推荐系统提供了鲁棒的工具。
Abstract
We study the behavior of
stochastic bandits algorithms
under \emph{
strategic behavior
} conducted by rational actors, i.e., the arms. Each arm is a strategic player who can modify its own reward whenever pulled, s
→