BriefGPT.xyz
Jun, 2022
同时学习具有一般图反馈的随机与对抗赌博机
Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback
HTML
PDF
Fang Kong, Yichi Zhou, Shuai Li
TL;DR
本文研究了在线学习中使用图形反馈的问题,提出了一种新的权衡机制,能够同时在随机环境和对抗环境取得最优结果,具有很好的推广性。
Abstract
The problem of
online learning
with
graph feedback
has been extensively studied in the literature due to its generality and potential to model various learning tasks. Existing works mainly study the adversarial a
→