BriefGPT.xyz
Aug, 2022
用于高分辨率、高吞吐率DNN训练的加速器友好型无损图像格式
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training
HTML
PDF
Jonghyun Bae, Woohyeon Baek, Tae Jun Ham, Jae W. Lee
TL;DR
介绍一个定制化的L3图像格式,最大限度地减少CPU介入,提高DNN训练过程中的数据准备和整体吞吐量
Abstract
The training process of
deep neural networks
(DNNs) is usually pipelined with stages for
data preparation
on CPUs followed by gradient computation on accelerators like GPUs. In an ideal pipeline, the end-to-end t
→