BriefGPT.xyz
Oct, 2024
基于扩散模型的私密合成文本生成
Private Synthetic Text Generation with Diffusion Models
HTML
PDF
Sebastian Ochs, Ivan Habernal
TL;DR
本研究针对扩散模型在差分隐私条件下生成合成文本的能力进行了深入探讨。通过广泛的实验,我们发现之前关于LLM的合成私密文本生成的假设未能满足,从而可能导致差分隐私的保证受到影响。此外,我们的研究结果表明,完全开源的LLM在隐私保护方面优于扩散模型,为未来的研究提供了重要参考。
Abstract
How capable are
Diffusion Models
of generating synthetics texts? Recent research shows their strengths, with performance reaching that of auto-regressive LLMs. But are they also good in generating synthetic data if the training was under
→