BriefGPT.xyz
Jul, 2020
仅标签成员推断攻击
Label-Only Membership Inference Attacks
HTML
PDF
Christopher A. Choquette Choo, Florian Tramer, Nicholas Carlini, Nicolas Papernot
TL;DR
本文介绍了一种基于标签的会员推理攻击方法,通过对模型预测标签进行扰动来获取精细的成员信号,该攻击未被置换机密度分数的防御措施所挫败,仅采用差分隐私和(强)L2 正则化等防御策略才能有效地抵御所有攻击。
Abstract
membership inference attacks
are one of the simplest forms of privacy leakage for
machine learning models
: given a data point and model, determine whether the point was used to train the model. Existing
→