BriefGPT.xyz
Jul, 2023
数据集精简遇见可证明的子集选择
Dataset Distillation Meets Provable Subset Selection
HTML
PDF
Murad Tukan, Alaa Maalouf, Margarita Osadchy
TL;DR
本文提出了一种在数据集精馏中初始化样品集的可证明的基于采样的方法,并将数据子集选择的思想与数据集精馏相结合,通过相对贡献的实例的概念优化性能。
Abstract
deep learning
has grown tremendously over recent years, yielding state-of-the-art results in various fields. However, training such models requires huge amounts of data, increasing the computational time and cost. To address this,
→