BriefGPT.xyz
Jun, 2024
自回归模型胜过扩散模型: Llama用于可扩展图像生成
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
HTML
PDF
Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng...
TL;DR
LlamaGen是一种新型的图像生成模型家族,采用大型语言模型中的原始“下一个标记预测”范例应用于视觉生成领域,不附带对视觉信号的归纳偏见,可以在适当缩放的情况下实现最先进的图像生成性能。
Abstract
We introduce
llamagen
, a new family of
image generation
models that apply original ``next-token prediction'' paradigm of large language models to visual generation domain. It is an affirmative answer to whether v
→