BriefGPT.xyz
Mar, 2022
使用隐私联邦学习免费训练一个分词工具
Training a Tokenizer for Free with Private Federated Learning
HTML
PDF
Eugene Bagdasaryan, Congzheng Song, Rogier van Dalen, Matt Seigel, Áine Cahill
TL;DR
本文针对模型中分布式设备上分布的私密数据进行保护的隐私联邦学习中,训练分词器通常需要使用没有额外隐私预算的方法才能成功进行,而本文提出了一种新方法来训练分词器,保证隐私且在性能上与未经隐私保护的分词器相媲美。
Abstract
federated learning
with
differential privacy
, i.e. private
federated learning
(PFL), makes it possible to train models on private data dis
→