BriefGPT.xyz
Jul, 2021
识别领域外动态:基于强化学习的基准和实验结果
Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results
HTML
PDF
Mohamad H Danesh, Alan Fern
TL;DR
该研究设计了一组来自常见强化学习环境的OODD基准,并基于循环隐式分位数网络(RIQN)设计了一种强的OODD基线方法,以监测自回归预测误差来检测OODD,并介绍和测试了其他三种基线方法。
Abstract
We study the problem of
out-of-distribution dynamics
(OODD) detection, which involves detecting when the dynamics of a
temporal process
change compared to the training-distribution dynamics. This is relevant to a
→