批量强化学习中的信息论考虑

May, 2019

Information-Theoretic Considerations in Batch Reinforcement Learning

Jinglin Chen, Nan Jiang

TL;DR本文探讨了在批处理模式下操作的值函数逼近方法，在有限样本和保证的前提下，分析了分布变化和强表示条件等假设的必需性和自然性，并提供了相关的理论结果。

Abstract

value-function approximation methods that operate in batch mode have foundational importance to reinforcement learning (RL). Finite sample guarantees for these methods often crucially rely on two types of assumpt