BriefGPT.xyz
May, 2019
关于可重参数化强化学习中的泛化差距
On the Generalization Gap in Reparameterizable Reinforcement Learning
HTML
PDF
Huan Wang, Stephan Zheng, Caiming Xiong, Richard Socher
TL;DR
研究重点在于利用再参数化技巧解决强化学习的泛化问题,并利用监督学习和迁移学习理论分析其推广能力,结果证明推广能力与环境转移、回报和策略函数类等因素有关。
Abstract
Understanding
generalization
in
reinforcement learning
(RL) is a significant challenge, as many common assumptions of traditional
supervised lear
→