May, 2024

高效的 FP4 混合量化扩散变换器(HQ-DiT)

TL;DRDiffusion Transformers (DiTs) are improved by Hybrid Floating-point Quantization (HQ-DiT), a post-training quantization method utilizing 4-bit floating-point precision on both weights and activations, resulting in low-precision quantization with minimal impact on performance.