BriefGPT.xyz
Feb, 2021
强化学习中防御奖励中毒攻击
Defense Against Reward Poisoning Attacks in Reinforcement Learning
HTML
PDF
Kiarash Banihashem, Adish Singla, Goran Radanovic
TL;DR
本文提出了防御策略,针对强化学习中的奖励污染攻击,并使用优化框架和性能保证来设计对抗策略。
Abstract
We study
defense strategies
against
reward poisoning attacks
in
reinforcement learning
. As a threat model, we consider attacks that minima
→