基于 GAN 的内在探索，用于高效样本的强化学习

Jun, 2022

GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning

Doğay Kamar, Nazım Kemal Üre, Gözde Ünal

TL;DR在这篇研究中，我们提出了基于生成对抗网络的内在奖励模块来解决强化学习中有效探索的问题，该模块学习所观察到状态的分布并发送内在奖励以引导智能体探索未被探索过的状态，我们通过对无奖励和稀疏奖励的游戏场景实验表明了该方法的有效性。

Abstract

In this study, we address the problem of efficient exploration in reinforcement learning. Most common exploration approaches depend on ran