BriefGPT.xyz
Jun, 2024
自适应随机加权平均
Adaptive Stochastic Weight Averaging
HTML
PDF
Caglar Demir, Arnab Sharma, Axel-Cyrille Ngonga Ngomo
TL;DR
提出了自适应随机权重平均(ASWA)技术,该技术结合了随机权重平均(SWA)和提前停止技术,仅在验证数据集上提高泛化性能时更新模型参数的运行平均值。对于图像分类到知识图谱上的多跳推理等广泛实验中,结果表明ASWA在模型和数据集上都能实现统计上更好的泛化。
Abstract
ensemble models
often improve
generalization
performances in challenging tasks. Yet, traditional techniques based on prediction averaging incur three well-known disadvantages: the computational overhead of traini
→