Jun, 2021

JIZHI: 用于百度规模在线推理的快速经济的模型即服务系统

TL;DRJIZHI is a Model-as-a-Service system for online real-time inference serving, which employs Staged Event-Driven Pipeline, heterogeneous and hierarchical storage, and an intelligent resource manager to optimize the performance and efficiency of handling huge deep models with sparse parameters, resulting in significant cost savings and increased throughput in Baidu.