BriefGPT.xyz
Nov, 2024
基于RoPE的变压器架构的电路复杂性界限
Circuit Complexity Bounds for RoPE-based Transformer Architecture
HTML
PDF
Bo Chen, Xiaoyu Li, Yingyu Liang, Jiangxuan Long, Zhenmei Shi...
TL;DR
本文研究了变压器架构的表达能力,特别是基于旋转位置嵌入(RoPE)的变压器模型。研究结果表明,在一定条件下,这种架构的复杂性界限更为紧凑,揭示了虽然RoPE在实际应用中表现出色,但其表达能力仍然存在基本限制。这为后续关于RoPE变压器的研究提供了理论指导。
Abstract
Characterizing the express power of the
Transformer Architecture
is critical to understanding its capacity limits and scaling law. Recent works provide the
Circuit Complexity
bounds to Transformer-like architectu
→