BriefGPT.xyz
Nov, 2018
强化学习中的应变感知探索
Contingency-Aware Exploration in Reinforcement Learning
HTML
PDF
Jongwook Choi, Yijie Guo, Marcin Moczulski, Junhyuk Oh, Neal Wu...
TL;DR
本文研究了学习环境的可控方面和连续性感知是否能够导致强化学习中更好的探索并开展了相关实验,结果表明使用我们的态势代表结合演员-评论家算法和计数探索实现了卓越的效果。
Abstract
This paper investigates whether learning
contingency-awareness
and controllable aspects of an environment can lead to better
exploration
in
reinf
→