BriefGPT.xyz
Nov, 2022
可证明的强化学习后门政策防御
Provable Defense against Backdoor Policies in Reinforcement Learning
HTML
PDF
Shubham Kumar Bharti, Xuezhou Zhang, Adish Singla, Xiaojin Zhu
TL;DR
该研究提出了一种基于子空间触发假设的强化学习背门策略的可证明防御机制,该机制通过将观察到的状态投射到一个安全子空间来消毒被污染的策略,从而实现了近似最优性。
Abstract
We propose a provable
defense mechanism
against
backdoor policies
in
reinforcement learning
under
→