BriefGPT.xyz
Jun, 2024
线性复杂度语言模型的尺度定律
Scaling Laws for Linear Complexity Language Models
HTML
PDF
Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun...
TL;DR
本研究通过研究线性复杂度语言模型的扩展性建立了基础,并对三种高效的线性架构进行了扩展行为的分析。结果显示,现有的线性复杂度语言模型在扩展能力、语言熟练度和知识保留方面与传统基于transformer的模型相似。
Abstract
The interest in
linear complexity models
for large language models is on the rise, although their scaling capacity remains uncertain. In this study, we present the
scaling laws
for linear complexity language mode
→