BriefGPT.xyz
Mar, 2024
通过基础模型API生成差分隐私合成数据2: 文本
Differentially Private Synthetic Data via Foundation Model APIs 2: Text
HTML
PDF
Chulin Xie, Zinan Lin, Arturs Backurs, Sivakanth Gopi, Da Yu...
TL;DR
我们提出了一种名为Aug-PE的增强版PE算法,应用于文本的复杂情境,通过API访问大型语言模型,生成差分隐私的合成文本,实验证明Aug-PE可以产生具有竞争性效用的差分隐私合成文本,从而便捷地在隐私保护的语言模型应用中提供更可访问的路线。
Abstract
text data
has become extremely valuable due to the emergence of machine learning algorithms that learn from it. A lot of high-quality
text data
generated in the real world is private and therefore cannot be share
→