BriefGPT.xyz
May, 2024
GTA: 借助导引的增强离线学习中的生成轨迹增强
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
HTML
PDF
Jaewoo Lee, Sujin Yun, Taeyoung Yun, Jinkyoo Park
TL;DR
离线强化学习中,利用生成轨迹增强(GTA)的数据增强策略可以提高数据质量并改善算法性能。
Abstract
offline reinforcement learning
(Offline RL) presents challenges of learning effective decision-making policies from static datasets without any online interactions.
data augmentation
techniques, such as noise inj
→