BriefGPT.xyz
Oct, 2024
使扩散变换器泛化的归纳偏见研究
On Inductive Biases That Enable Generalization of Diffusion Transformers
HTML
PDF
Jie An, De Wang, Pengsheng Guo, Jiebo Luo, Alexander Schwing
TL;DR
本研究解决了基于变换器的去噪网络(如扩散变换器)是否具有可通过几何自适应谐波基表达的归纳偏见的问题。研究发现,改进局部注意力窗口的设置可以显著提高扩散变换器的泛化能力,尤其是在可用训练数据较少的情况下,验证了这些归纳偏见在增强泛化和生成质量方面的作用。
Abstract
Recent work studying the
Generalization
of diffusion models with UNet-based denoisers reveals inductive biases that can be expressed via geometry-adaptive harmonic bases. However, in practice, more recent
Denoising Netw
→