TL;DR本文研究了恶意代理如何在 linear contextual bandit algorithm 上执行攻击,并提出了一种有效的算法来执行此攻击。
Abstract
contextual bandit algorithms are applied in a wide range of domains, from advertising to recommender systems, from clinical trials to education. In many of these domains, malicious agents may have incentives to a