样本高效强化学习的动态抽象表示学习

Oct, 2022

样本高效强化学习的动态抽象表示学习

Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning

Mehdi Dadvar, Rashmeet Kaur Nayyar, Siddharth Srivastava

TL;DR本文介绍了一种新的从上至下的方法，用于在执行强化学习的同时构建状态抽象，动态计算一个基于Q值分散的抽象，结果表明，这种方法自动学习细调问题的抽象，具有较强的样本效率，并使强化学习代理明显优于现有方法。

Abstract

In many real-world problems, the learning agent needs to learn a problem's abstractions and solution simultaneously. However, most such abstractions need to be designed and refined by hand for different problems and domains of application. This paper presents a novel top-down approach