BriefGPT.xyz
Sep, 2023
ACT:通过优势调节实现决策变换的动态规划赋能
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
HTML
PDF
Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang...
TL;DR
使用决策转换器(DT)和动态规划相结合的方法(ACT),克服了动态规划的弱点,通过有效的轨迹拼接和鲁棒的动作生成,在环境随机性方面表现出色,优于各种基准方法。
Abstract
decision transformer
(DT), which employs expressive sequence modeling techniques to perform
action generation
, has emerged as a promising approach to
→