BriefGPT.xyz
Feb, 2025
C2T:一种用于推测解码的分类器基础树构建方法
C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
HTML
PDF
Feiye Huo, Jianchao Tan, Kefeng Zhang, Xunliang Cai, Shengli Sun
TL;DR
本研究针对当前推测解码方法在构建令牌树和验证候选令牌方面的低效问题,提出了一种新颖的C2T方法。该方法利用轻量级分类器动态生成和修剪令牌树,从而在多个基准测试中显著提高性能,减少了候选令牌总数25%,同时保持或改善了接受长度,展现出其潜在的影响力。
Abstract
The growing scale of
Large Language Models
(LLMs) has exacerbated inference latency and computational costs.
Speculative Decoding
methods, which aim to mitigate these issues, often face inefficiencies in the cons
→