BriefGPT.xyz
Jun, 2024
具有动机的对手:对抗鲁棒性的战略性替代方案
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness
HTML
PDF
Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld
TL;DR
通过战略建模,我们的研究提出使用对手的动机作为归纳偏差学习的一种方式,通过战略训练在不确定奖励条件下防御对手,此方法甚至对对手动机的轻微了解也能有用,潜在收益程度取决于动机与学习任务结构的关系。
Abstract
adversarial training
aims to defend against *
adversaries
*: malicious opponents whose sole aim is to harm predictive performance in any way possible - a rather harsh perspective, which we assert results in unneces
→