BriefGPT.xyz
Nov, 2019
MANGA: 方法无关的神经策略泛化与适应
MANGA: Method Agnostic Neural-policy Generalization and Adaptation
HTML
PDF
Homanga Bharadhwaj, Shoichiro Yamaguchi, Shin-ichi Maeda
TL;DR
该论文介绍了一种名为MANGA的神经策略泛化和适应方法,通过分离策略学习和系统识别的过程,将学习到的策略有效地转移到具有不同动态参数和电机噪声变化的未知环境中,我们通过4个不同的MuJoCo代理实验来证明了该方法的有效性。
Abstract
In this paper we target the problem of transferring policies across multiple environments with different dynamics parameters and
motor noise variations
, by introducing a framework that decouples the processes of
policy
→