BriefGPT.xyz
May, 2023
使用邻居比较攻击语言模型的成员推断
Membership Inference Attacks against Language Models via Neighbourhood Comparison
HTML
PDF
Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan...
TL;DR
本研究探讨了参考模型攻击在更现实的情况下对数据分布的脆弱性,并提出并评估了领域攻击方法,以提高模型隐私性。
Abstract
membership inference attacks
(MIAs) aim to predict whether a data sample was present in the training data of a machine learning model or not, and are widely used for assessing the
privacy risks
of
→