This paper introduces a novel approach to membership inference attacks (MIA) targeting stable diffusion computer vision models, specifically focusing on the highly sophisticated Stable Diffusion V2 by StabilityAI. MIAs aim to extract sensitive information about a model's training data, posing significant privacy concerns. Despite its advancements in image synthesis, our research reveals privacy vulnerabilities in the stable diffusion models' outputs. Exploiting this information, we devise a black-box MIA that only needs to query the victim model repeatedly. Our methodology involves observing the output of a stable diffusion model at different generative epochs and training a classification model to distinguish when a series of intermediates originated from a training sample or not. We propose numerous ways to measure the membership features and discuss what works best. The attack's efficacy is assessed using the ROC AUC method, demonstrating a 60\% success rate in inferring membership information. This paper contributes to the growing body of research on privacy and security in machine learning, highlighting the need for robust defenses against MIAs. Our findings prompt a reevaluation of the privacy implications of stable diffusion models, urging practitioners and developers to implement enhanced security measures to safeguard against such attacks.

该研究介绍了一种新的会员推理攻击方法，针对稳定扩散计算机视觉模型，特别关注了由StabilityAI开发的高度复杂的稳定扩散V2。我们的研究揭示了稳定扩散模型的输出存在的隐私漏洞，利用这些信息，我们设计了一种只需要反复查询受害模型的黑盒会员推理攻击方法。该研究对会员特征进行了多种测量，并讨论了最佳实践。通过ROC AUC方法评估了攻击的有效性，在推断会员信息方面成功率达到60％。该论文对机器学习中的隐私和安全问题做出了贡献，并强调了对会员推理攻击实施强大防御措施的迫切性。我们的研究结果促使重新评估稳定扩散模型的隐私影响，并敦促从业者和开发者采取增强安全措施以防范此类攻击。

稳定扩散模型中的隐私威胁