BriefGPT.xyz
Jun, 2023
多样化用户行为下排名策略的离线评估
Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
HTML
PDF
Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto...
TL;DR
该研究提出了自适应 IPS(AIPS)的方法来解决IPS方法在排名设置中应用的巨大方差问题,还探讨了用户行为多样性的影响。该方法极大地提高了排名系统的 OPE 有效性。
Abstract
Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their
off-policy evaluation
(OPE), aiming towards an accurate performance evaluation of
ranking policies
using logg
→