BriefGPT.xyz
Oct, 2022
在线凸优化算法(无内存限制)
Online Convex Optimization with Unbounded Memory
HTML
PDF
Raunak Kumar, Sarah Dean, Robert D. Kleinberg
TL;DR
本文提出了一个新的在线凸优化框架,能够利用过去的决策历史对当前损失进行建模,并引入了“p有效内存容量”来量化过去决策对当前损失的最大影响。在此框架下,证明了一些政策遗憾的较好上界,并展示了该框架对于各种在线学习任务的适用性。
Abstract
online convex optimization
(OCO) is a widely used framework in
online learning
. In each round, the learner chooses a decision in some convex set and an adversary chooses a convex loss function, and then the learn
→