BriefGPT.xyz
Nov, 2022
强化学习中的知识迁移无效行为学习
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
HTML
PDF
Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh
TL;DR
该研究提出了一种系统性的方法来将先验知识引入强化学习算法中,试图通过学习无关的行为来降低样本复杂性,并通过实验证明了其可以提高算法的样本效率和转移学习能力。
Abstract
reinforcement learning
(RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed
→