广义线性函数逼近强化学习中的乐观主义

Dec, 2019

Optimism in Reinforcement Learning with Generalized Linear Function Approximation

Yining Wang, Ruosong Wang, Simon S. Du, Akshay Krishnamurthy

TL;DR本论文提出了一种新的基于广义线性函数逼近的回合式强化学习算法，并在乐观闭合假设下分析其性能，证明了其具有更低的复杂度，并且是强化学习中第一个具有统计和计算效率的基于广义线性函数的算法。

Abstract

We design a new provably efficient algorithm for episodic reinforcement learning with generalized linear function approximation. We analyz