BriefGPT.xyz
Mar, 2025
推理时间规模化的思路如何促进生成预训练算法
Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms
HTML
PDF
Jiaming Song, Linqi Zhou
TL;DR
本研究解决了生成预训练领域中关于自回归模型和扩散模型的算法创新停滞的问题。提出采用以推理为中心的方法,优先考虑推理时间的规模效率,从而激发新的生成预训练算法。通过实例展示,该方法能在提高样本质量的同时,显著提升推理效率,推动多模态智能的发展。
Abstract
Recent years have seen significant advancements in foundation models through
Generative Pre-training
, yet algorithmic innovation in this space has largely stagnated around autoregressive models for discrete signals and
→