Splitting the inference model between device, edge server, and cloud can improve the performance of EI greatly. Additionally, the non-orthogonal multiple access (noma), which is the key supporting technologies of B5G/6G, can achieve massive connections and high spectrum efficiency. Mot