BriefGPT.xyz
Dec, 2023
开放式世界中的学习课程
Learning Curricula in Open-Ended Worlds
HTML
PDF
Minqi Jiang
TL;DR
该论文介绍了一种称为无监督环境设计(UED)的方法,通过自动生成无限的训练环境序列或课程以匹配或超过真实世界的复杂性,从而实现深度强化学习代理在鲜有环境示例中表现出显著改进的鲁棒性和泛化能力,这些自生成的环境课程为不断生成和掌握自主设计的额外挑战的开放式学习系统提供了有希望的路径。
Abstract
deep reinforcement learning
(RL) provides powerful methods for training optimal sequential decision-making agents. As collecting real-world interactions can entail additional costs and safety risks, the common paradigm of
→