BriefGPT.xyz
Oct, 2023
应用Exo解决矩阵乘法微内核生成
Tackling the Matrix Multiplication Micro-kernel Generation with Exo
HTML
PDF
Adrián Castelló, Julian Bellavita, Grace Dinh, Yuka Ikarashi, Héctor Martínez
TL;DR
矩阵乘法(或GEMM)的优化是近几十年来的需求之一,本研究提出了一种使用Exo编译器生成micro-kernels的逐步过程,并且性能接近(甚至优于)使用内置函数或汇编代码手动开发的micro-kernels,同时提高了生成代码的可移植性。
Abstract
The
optimization
of the
matrix multiplication
(or
gemm
) has been a need during the last decades. This operation is considered the flagship
→