BriefGPT.xyz
Feb, 2025
掩码自编码器是扩散模型有效的标记器
Masked Autoencoders Are Effective Tokenizers for Diffusion Models
HTML
PDF
Hao Chen, Yujin Han, Fangyi Chen, Xiang Li, Yidong Wang...
TL;DR
本研究解决了扩散模型中标记器的潜在空间特性尚未充分探索的问题。提出的MAETok方法通过掩码建模,学习语义丰富的潜在空间,同时保持重建保真度。研究表明,潜在空间的结构对扩散模型的有效性至关重要,提出的方法在图像合成任务中显著提升了生成质量和效率。
Abstract
Recent advances in latent
Diffusion Models
have demonstrated their effectiveness for high-resolution image synthesis. However, the properties of the
Latent Space
from tokenizer for better learning and generation
→