BriefGPT.xyz
Jul, 2024
AI加速器上基础模型的推理优化
Inference Optimization of Foundation Models on AI Accelerators
HTML
PDF
Youngsuk Park, Kailash Budhathoki, Liangfu Chen, Jonas Kübler, Jiaji Huang...
TL;DR
Transformer架构的大型语言模型和AI加速器的推断优化技术在生成式人工智能中扮演重要角色,并讨论了系统优化、关注力计算和模型压缩等方面的技术。
Abstract
Powerful foundation models, including
large language models
(LLMs), with
transformer architectures
have ushered in a new era of
generative ai
→