BriefGPT.xyz
Jun, 2022
可学习、值得学习且尚未学习的点的优先训练
Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt
HTML
PDF
Sören Mindermann, Jan Brauner, Muhammed Razzak, Mrinank Sharma, Andreas Kirsch...
TL;DR
使用可减少示例并且减少噪点的筛选技术进行训练能够减小无关点对模型学习的干扰。在类似RHO-LOSS这样可削减的示例中训练的时间比现有技术短得多,提高了准确性,并在广泛的数据集、超参数和架构中加快了训练
Abstract
training
on web-scale data can take months. But much computation and time is wasted on redundant and noisy points that are already learnt or not learnable. To accelerate
training
, we introduce
→