BriefGPT.xyz
Oct, 2020
低数据迁移学习的深度集成
Deep Ensembles for Low-Data Transfer Learning
HTML
PDF
Basil Mustafa, Carlos Riquelme, Joan Puigcerver, andAndré Susano Pinto, Daniel Keysers...
TL;DR
本文探讨了从预训练模型中创建集成模型的不同方法,并提出了一种有效的算法来识别下游数据集的预训练模型子集。在19项下游任务中(视觉任务适应基准),即使从超过2000个预训练模型中进行选择,其实现了具有较低推理预算的最先进性能,并且在ImageNet变体上对分布转移具有更好的鲁棒性。
Abstract
In the low-data regime, it is difficult to train good supervised models from scratch. Instead practitioners turn to
pre-trained models
, leveraging
transfer learning
. Ensembling is an empirically and theoretically
→