BriefGPT.xyz
Nov, 2023
通过自提示校准对精调大型语言模型进行实用的成员推断攻击
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration
HTML
PDF
Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li...
TL;DR
基于自校准概率变异的成员推断攻击(SPV-MIA)提出了一种新的对严格微调但无过拟合和隐私保护的LLMs泄露隐私的成员推断攻击方法。
Abstract
membership inference attacks
(MIA) aim to infer whether a target data record has been utilized for model training or not. Prior attempts have quantified the
privacy risks
of
→