BriefGPT.xyz
Jun, 2021
带有对抗扰动的公平分类
Fair Classification with Adversarial Perturbations
HTML
PDF
L. Elisa Celis, Anay Mehrotra, Nisheeth K. Vishnoi
TL;DR
本文研究在具有全知对手的情况下的公平分类问题,在此情况下对于一个给定的参数η,对手可以随意选择任意η分数的训练样本并随意扰动它们的保护属性。我们提出了一种优化框架来学习这种对抗情况下的公平分类器,并具有可证明的准确性和公平性保证。
Abstract
We study
fair classification
in the presence of an
omniscient adversary
that, given an $\eta$, is allowed to choose an arbitrary $\eta$-fraction of the training samples and arbitrarily perturb their
→