BriefGPT.xyz
Aug, 2018
基于状态抽象的近似探索
Approximate Exploration through State Abstraction
HTML
PDF
Adrien Ali Taïga, Aaron Courville, Marc G. Bellemare
TL;DR
研究强化学习中探索和近似之间的相互作用,提出一种基于密度建模的方法来改善探索,探讨伪计数奖励在此方法中的应用,发现了在其应用中可能存在的过度或不足探索问题,并提出一种新的伪计数奖励来缓解这些问题。
Abstract
Although
exploration
in
reinforcement learning
is well understood from a theoretical point of view, provably correct methods remain impractical. In this paper we study the interplay between
→