BriefGPT.xyz
Dec, 2021
基于分布式资源感知的异构系统学习
DISTREAL: Distributed Resource-Aware Learning in Heterogeneous Systems
HTML
PDF
Martin Rapp, Ramin Khalili, Kilian Pfeiffer, Jörg Henkel
TL;DR
介绍了一种分布式训练神经网络的资源管理机制——DISTREAL,采用动态dropout机制调整每个设备的计算复杂度,通过Pareto-optimal DSE技术和联邦学习系统,实现了设备自主选择dropout向量来适应不断变化的可用资源,大幅提高了收敛速度而不损失精度。
Abstract
We study the problem of
distributed training
of
neural networks
(NNs) on devices with heterogeneous, limited, and time-varying availability of computational resources. We present an adaptive, resource-aware, on-d
→