在线排名与Top-1反馈

Oct, 2014

Online Ranking with Top-1 Feedback

Sougata Chaudhuri, Ambuj Tewari

TL;DR本研究探讨了一种在线学习算法，使用新颖的 Top-1 反馈模型，评估对多样性兴趣用户的固定排名商品排名能力，并证明了其对于几种流行的排名度量具有最小化后悔的能力。

Abstract

We consider a setting where a system learns to rank a fixed set of m items. The goal is produce a good ranking for users with diverse interests who interact with the system for T rounds in an online fashion. We consider a novel top-1 feedback model for this problem: at the end of each