BriefGPT.xyz
Apr, 2017
具有相关臂的多路对决自适应波段算法
Multi-dueling Bandits with Dependent Arms
HTML
PDF
Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue
TL;DR
本文研究具有相关性的多股臂的多对打算法,在推荐系统等领域可以更高效地学习和优化用户的基于偏好的关键特征,使用自对抗算法,结合高斯过程统计方法可以更准确地捕捉相关性,提升算法的效果。
Abstract
The
dueling bandits
problem is an
online learning
framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback.
→