BriefGPT.xyz
May, 2023
不是所有的图像区域都很重要: 用掩码向量量化进行自回归图像生成
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
HTML
PDF
Mengqi Huang, Zhendong Mao, Quan Wang, Yongdong Zhang
TL;DR
本研究提出了一种新的两阶段框架,包括掩蔽量化VAE(MQ-VAE)和Stackformer,在图像生成中减轻冗余感知信息的影响,实现了高效有效的图像生成。
Abstract
Existing
autoregressive models
follow the two-stage generation paradigm that first learns a codebook in the
latent space
for image reconstruction and then completes the
→