BriefGPT.xyz
Feb, 2018
公正和多样化的基于DPP的数据概述
Fair and Diverse DPP-based Data Summarization
HTML
PDF
L. Elisa Celis, Vijay Keswani, Damian Straszak, Amit Deshpande, Tarun Kathuria...
TL;DR
通过加入公平性约束条件,该文章提出了一种基于确定性多元分布的方法,并且使用了快速的抽样算法以产出多样化且公平的数据子集。
Abstract
Sampling methods that choose a subset of the data proportional to its diversity in the feature space are popular for
data summarization
. However, recent studies have noted the occurrence of
bias
(under- or over-r
→