BriefGPT.xyz
Feb, 2013
具有切换成本和其他自适应对手的在线学习
Online Learning with Switching Costs and Other Adaptive Adversaries
HTML
PDF
Nicolo Cesa-Bianchi, Ofer Dekel, Ohad Shamir
TL;DR
本文研究了预测中的不同类型自适应(非固定的)对手的强度,使用新概念的策略遗憾去衡量玩家的表现,特别关注记忆和切换成本的自适应对手,具有均摊2/3次幂的速率且强度显著较弱。
Abstract
We study the power of different types of adaptive (nonoblivious) adversaries in the setting of
prediction
with expert advice, under both full information and bandit feedback. We measure the player's performance using a new notion of
→