BriefGPT.xyz
May, 2023
利用点击反馈对在线学习排序进行对抗攻击
Adversarial Attacks on Online Learning to Rank with Click Feedback
HTML
PDF
Jinhang Zuo, Zhiyao Zhang, Zhiyong Wang, Shuai Li, Mohammad Hajiesmaili...
TL;DR
本文研究了攻击多个OLTR变体的策略,并提出了一般的攻击策略来攻击任何算法,在合成数据和真实数据上的实验验证了我们提出的攻击算法的有效性。
Abstract
online learning to rank
(OLTR) is a sequential decision-making problem where a learning agent selects an ordered list of items and receives feedback through user clicks. Although potential attacks against
oltr algorithm
→