BriefGPT.xyz
Nov, 2024
探索目标条件强化学习中潜在状态簇的边界
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
HTML
PDF
Yuanlin Duan, Guofeng Cui, He Zhu
TL;DR
本研究针对无监督目标条件强化学习在未知环境中高效探索的挑战,提出了一种新的目标导向探索算法“簇边探索($CE^2$)”。该方法通过聚类策略选择在稀疏探索区域内可达的目标状态,从而显著提高了机器人在复杂环境中的探索效率,较基线方法表现更为优秀。
Abstract
Exploring unknown environments efficiently is a fundamental challenge in unsupervised
Goal-Conditioned
Reinforcement Learning
. While selecting exploratory goals at the frontier of previously explored states is an
→