BriefGPT.xyz
May, 2023
动态变压器提供了一种虚假的效率感
Dynamic Transformers Provide a False Sense of Efficiency
HTML
PDF
Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu...
TL;DR
本文提出了一种名为SAME的攻击框架,重点针对多出口模型的内部预测,有效地降低了各种多出口模型的效率,验证了其有效性和泛化能力。
Abstract
Despite much success in natural language processing (
nlp
),
pre-trained language models
typically lead to a high computational cost during inference.
→