Dec, 2023
Moirai:面向异构设备的分布式推理优化放置
Moirai: Towards Optimal Placement for Distributed Inference on Heterogeneous Devices
Beibei Zhang, Hongwei Zhu, Feng Gao, Zhihui Yang, Sean Xiaoyang Wang
TL;DRMoirai is a device placement algorithm that leverages runtime inter-operator fusion to render a coarsened computation graph, reducing the search space and improving end-to-end inference performance for Deep Neural Networks.