BriefGPT.xyz
Oct, 2024
未知上下文分布的跨学习上下文赌博机的高概率界限
High Probability Bound for Cross-Learning Contextual Bandits with Unknown Context Distributions
HTML
PDF
Ruiyuan Huang, Zengfeng Huang
TL;DR
本研究解决了在上下文赌博机中,学习者面临未知上下文分布的跨学习问题。我们提出对Schneider和Zimmert(2023)算法的深入分析,证明其在高概率下也能实现接近最优的遗憾值。研究结果将促使相关领域的算法设计更加关注高概率性能,而非仅仅关注期望遗憾。
Abstract
Motivated by applications in online bidding and sleeping bandits, we examine the problem of
Contextual Bandits
with
Cross Learning
, where the learner observes the loss associated with the action across all possib
→