BriefGPT.xyz
Feb, 2024
组分布稳健数据集蒸馏及风险最小化
Group Distributionally Robust Dataset Distillation with Risk Minimization
HTML
PDF
Saeed Vahidian, Mingyu Wang, Jianyang Gu, Vyacheslav Kungurtsev, Wei Jiang...
TL;DR
通过结合聚类和风险度量的最小化算法,实现数据集精炼,具备对子群体的有效泛化和稳健性,为解决合成数据集在面对低人口密度地区样本时表现优秀的问题提供了理论依据和数值实验验证。
Abstract
dataset distillation
(DD) has emerged as a widely adopted technique for crafting a
synthetic dataset
that captures the essential information of a training dataset, facilitating the training of accurate
→