BriefGPT.xyz
Feb, 2024
超越最坏情况攻击:非劣政策下的自适应防御强化学习
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
HTML
PDF
Xiangyu Liu, Chenghao Deng, Yanchao Sun, Yongyuan Liang, Furong Huang
TL;DR
基于强化学习在现实世界中的蓬勃发展,我们研究了在状态对抗攻击模型下的策略稳健性,并致力于在有限策略类中找到既稳健又高效的近最优解,通过迭代发现非支配策略形成一个最小的近最优解,从而确保在不同攻击场景下的适应性。
Abstract
In light of the burgeoning success of
reinforcement learning
(RL) in diverse real-world applications, considerable focus has been directed towards ensuring RL policies are robust to
adversarial attacks
during tes
→