强化学习中的函数逼近对称性学习

Jun, 2017

强化学习中的函数逼近对称性学习

Symmetry Learning for Function Approximation in Reinforcement Learning

Anuj Mahajan, Theja Tulabandhula

TL;DR研究如何利用对称性以实现强化学习的样本效率，引入检测对称性的新方法，并证明其完整性；提供了将发现的对称性用于功能逼近的框架，并证明基于潜在效益的奖励塑形对于利用对称性特别有效；实验表明，利用对称性信息可以显著提高学习性能。

Abstract

In this paper we explore methods to exploit symmetries for ensuring sample efficiency in reinforcement learning (RL), this problem deserves ever increasing attention with the recent advances in the use of