BriefGPT.xyz
Apr, 2024
用于大型基于Transformer的模型的高效离群层
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
HTML
PDF
Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li...
TL;DR
我们介绍了一种异常值高效的现代 Hopfield 模型(命名为 OutEffHop),并使用它来解决量化巨大的基于 Transformer 的模型中的异常值引起的挑战。
Abstract
We introduce an
outlier-efficient modern hopfield model
(termed $\mathtt{OutEffHop}$) and use it to address the outlier-induced challenge of
quantizing gigantic transformer-based models
. Our main contribution is
→