BriefGPT.xyz
Aug, 2023
基于随机奖励稳定化的模型无关强化学习在推荐系统中的应用
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
HTML
PDF
Tianchi Cai, Shenliao Bao, Jiyan Jiang, Shiji Zhou, Wenpeng Zhang...
TL;DR
基于无模型的强化学习推荐系统,通过引入两种随机奖励稳定化框架以替换直接的随机反馈,成功应对了用户在不同时间对同一项的随机反馈问题。
Abstract
model-free rl-based recommender systems
have recently received increasing research attention due to their capability to handle
partial feedback
and
→