TL;DR提出了一种结合两种可解释强化学习技术的方法,名为 XRL-DINE,可用于解释具有设计时间不确定性的自适应系统中的 Deep RL 决策。
Abstract
Design time uncertainty poses an important challenge when developing a
self-adaptive system. As an example, defining how the system should adapt when
facing a new environment state, requires understanding the precise effect of an
adaptation, which may not be known at design time. Onlin