BriefGPT.xyz
Feb, 2023
医生对口罩使用的结论:有用但需辩证看待
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
HTML
PDF
Christoph Dann, Chen-Yu Wei, Julian Zimmert
TL;DR
本研究提出了一种广义的最好结果算法以及如何通过规范化导向跟随和在线镜像下降算法实现在线学习中的最好结果,将这种算法应用于上下文、图和表马尔科夫决策过程中。
Abstract
Best-of-both-worlds algorithms for
online learning
which achieve near-optimal
regret
in both the adversarial and the stochastic regimes have received growing attention recently. Existing techniques often require
→