BriefGPT.xyz
Nov, 2023
面向能适应非结构化数据的无模型强化学习算法的发展
Towards model-free RL algorithms that scale well with unstructured data
HTML
PDF
Joseph Modayil, Zaheer Abbas
TL;DR
强化学习算法在尺度递增和非结构化观测方面表现良好的方法,能够有效利用外部知识构建预测结构,并提供环境和算法供研究无结构观测向量和平面动作空间的缩放问题。
Abstract
Conventional
reinforcement learning
(RL) algorithms exhibit broad generality in their theoretical formulation and high performance on several challenging domains when combined with powerful
function approximation
→