BriefGPT.xyz
Jun, 2024
利用深度鲁棒分类器中的边缘一致性检测脆弱决策
Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers
HTML
PDF
Jonas Ngnawé, Sabyasachi Sahoo, Yann Pequignot, Frédéric Precioso, Christian Gagné
TL;DR
本文引入了边际一致性的概念,该概念将输入空间的边际与鲁棒模型的逻辑边际联系起来,用于高效地检测易受攻击样本和评估部署情景中的对抗脆弱性。
Abstract
Despite extensive research on
adversarial training strategies
to improve
robustness
, the decisions of even the most robust
deep learning models
→