BriefGPT.xyz
Jul, 2022
基于联合训练的生成潜空间的强化学习智能体指导的反事实
Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space
HTML
PDF
Eric Yeh, Pedro Sequeira, Jesse Hostetler, Melinda Gervasio
TL;DR
本篇论文提出了一种基于变分自编码器的生成方法,通过特征代表智能体行为的观察值,生成未知而合理的反事实样本,可以提高强化学习代理的决策质量。
Abstract
We present a novel
generative method
for producing unseen and plausible
counterfactual examples
for
reinforcement learning
(RL) agents bas
→