BriefGPT.xyz
Dec, 2023
分布式机器学习流量的突发性分析
On the Burstiness of Distributed Machine Learning Traffic
HTML
PDF
Natchanon Luangsomboon, Fahimeh Fazel, Jörg Liebeherr, Ashkan Sobhani, Shichao Guan...
TL;DR
研究了分布式机器学习模型的网络流量特征和短期爆发性,发现分布式机器学习流量在短时间尺度上有很高的爆发性,研究了不同时间尺度上的流量爆发性度量,并揭示了分布式机器学习流量对拥塞和流量控制算法的挑战。
Abstract
Traffic from
distributed training
of machine learning (ML) models makes up a large and growing fraction of the traffic mix in enterprise data centers. While work on distributed ML abounds, the
network traffic
gen
→