Ali Jadbabaie, Alexander Rakhlin, Shahin Shahrampour, Karthik Sridharan
TL;DR本文提出了一种完全自适应的方法,适用于在线学习中的动态比较基准,并且应用到了零和博弈中。
Abstract
Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against com