BriefGPT.xyz
Jun, 2024
一幅图像对于重建与生成而言价值32个令牌
An Image is Worth 32 Tokens for Reconstruction and Generation
HTML
PDF
Qihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers...
TL;DR
这篇研究论文介绍了一种基于Transformer的一维令牌化方法(TiTok),其将图像令牌化为一维潜在序列,通过提供更紧凑的潜在表示形式,实现了比传统技术更高效和更有效的图像合成。
Abstract
Recent advancements in
generative models
have highlighted the crucial role of
image tokenization
in the efficient synthesis of high-resolution images. Tokenization, which transforms images into
→