BriefGPT.xyz
Jan, 2024
基于奖励相关性过滤的线性离线强化学习
Reward-Relevance-Filtered Linear Offline Reinforcement Learning
HTML
PDF
Angela Zhou
TL;DR
这篇论文研究了离线强化学习中带有判决论但非估计稀疏性的线性函数逼近。
Abstract
This paper studies
offline reinforcement learning
with
linear function approximation
in a setting with decision-theoretic, but not estimation sparsity. The
→