DeLF: 使用基础模型设计学习环境

Jan, 2024

DeLF: Designing Learning Environments with Foundation Models

Aida Afshar, Wenchao Li

TL;DR通过使用大语言模型设计和编码用户预期的学习场景，我们提出了一种名为DeLF的方法，用于设计强化学习环境的组件，以解决在实践中应用RL在许多简单应用中仍然困难的问题。我们通过在四个不同的学习环境上测试我们的方法，证明DeLF能够为相应的RL问题获得可执行的环境代码。

Abstract

reinforcement learning (RL) offers a capable and intuitive structure for the fundamental sequential decision-making problem. Despite impressive breakthroughs, it can still be difficult to employ RL in practice in many simple applications. In this paper, we try to address this issue by