by nunchaku-ai
Open source · 8k downloads · 257 likes
Nunchaku Qwen Image is a text-to-image generation model optimized for transforming descriptions into high-quality visuals. It stands out for its ability to accurately render complex text embedded within images while maintaining strong performance through advanced quantization techniques. Its lightweight versions, available in 4 or 8 generation steps, strike a balance between speed and quality, making them suitable for both consumer and professional applications. The model excels in energy efficiency and compatibility with the latest GPU architectures, ensuring accessibility even on limited hardware configurations. Ideal for content creators, developers, or businesses looking to automate image production from text prompts.

This repository contains Nunchaku-quantized versions of Qwen-Image, designed to generate high-quality images from text prompts, advances in complex text rendering. It is optimized for efficient inference while maintaining minimal loss in performance.
Data Type: INT4 for non-Blackwell GPUs (pre-50-series), NVFP4 for Blackwell GPUs (50-series).
Rank: r32 for faster inference, r128 for better quality but slower inference.
Standard inference speed models for general use
| Data Type | Rank | Model Name | Comment |
|---|---|---|---|
| INT4 | r32 | svdq-int4_r32-qwen-image.safetensors | |
| r128 | svdq-int4_r128-qwen-image.safetensors | ||
| NVFP4 | r32 | svdq-fp4_r32-qwen-image.safetensors | |
| r128 | svdq-fp4_r128-qwen-image.safetensors |
4-step distilled models fused with Qwen-Image-Lightning-4steps-V1.0 LoRA using LoRA strength = 1.0
| Data Type | Rank | Model Name | Comment |
|---|---|---|---|
| INT4 | r32 | svdq-int4_r32-qwen-image-lightningv1.0-4steps.safetensors | Fused with Qwen-Image-Lightning-4steps-V1.0 LoRA |
| r128 | svdq-int4_r128-qwen-image-lightningv1.0-4steps.safetensors | Fused with Qwen-Image-Lightning-4steps-V1.0 LoRA. Better quality, slower inference | |
| NVFP4 | r32 | svdq-fp4_r32-qwen-image-lightningv1.0-4steps.safetensors | Fused with Qwen-Image-Lightning-4steps-V1.0 LoRA |
| r128 | svdq-fp4_r128-qwen-image-lightningv1.0-4steps.safetensors | Fused with Qwen-Image-Lightning-4steps-V1.0 LoRA. Better quality, slower inference |
8-step distilled models fused with Qwen-Image-Lightning-8steps-V1.1 LoRA using LoRA strength = 1.0
| Data Type | Rank | Model Name | Comment |
|---|---|---|---|
| INT4 | r32 | svdq-int4_r32-qwen-image-lightningv1.1-8steps.safetensors | Fused with Qwen-Image-Lightning-8steps-V1.1 LoRA |
| r128 | svdq-int4_r128-qwen-image-lightningv1.1-8steps.safetensors | Fused with Qwen-Image-Lightning-8steps-V1.1 LoRA. Better quality, slower inference | |
| NVFP4 | r32 | svdq-fp4_r32-qwen-image-lightningv1.1-8steps.safetensors | Fused with Qwen-Image-Lightning-8steps-V1.1 LoRA |
| r128 | svdq-fp4_r128-qwen-image-lightningv1.1-8steps.safetensors | Fused with Qwen-Image-Lightning-8steps-V1.1 LoRA. Better quality, slower inference |

@inproceedings{
li2024svdquant,
title={SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models},
author={Li*, Muyang and Lin*, Yujun and Zhang*, Zhekai and Cai, Tianle and Li, Xiuyu and Guo, Junxian and Xie, Enze and Meng, Chenlin and Zhu, Jun-Yan and Han, Song},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025}
}