AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsZ Image Turbo GGUF

Z Image Turbo GGUF

by jayn7

Open source · 40k downloads · 348 likes

3.2
(348 reviews)ImageAPI & Local
About

Z Image Turbo GGUF is a quantized version of the Z-Image Turbo model, specifically optimized for efficient use with limited resources. This model excels in generating and editing images from text descriptions, delivering fast and high-quality results. Its core capabilities include creating realistic images, modifying existing visual elements, and transforming sketches into detailed images. It is particularly well-suited for digital artists, multimedia application developers, and content creators looking to automate or enhance their visual creation process. What sets it apart is its balance between performance and accessibility, enabling seamless integration into various creative workflows thanks to its compatibility with tools like ComfyUI and Diffusers.

Documentation

Quantized GGUF versions of the Z-Image Turbo by Tongyi-Mai.

📂 Available Models

ModelDownload
Z-Image Turbo GGUFDownload
Qwen3-4B (Text Encoder)unsloth/Qwen3-4B-GGUF

📷 Example Comparison

z_image_comparison_1 z_image_comparison_2 z_image_comparison_3

Model Information

Check out the original model card Z-Image Turbo for detailed information about the model.

Usage

The model can be used with:

  • ComfyUI-GGUF by city96
  • Diffusers

Example Usage

Diffusers
Sh
pip install git+https://github.com/huggingface/diffusers
Py
from diffusers import ZImagePipeline, ZImageTransformer2DModel, GGUFQuantizationConfig
import torch

prompt = "Young Chinese woman in red Hanfu, intricate embroidery. Impeccable makeup, red floral forehead pattern. Elaborate high bun, golden phoenix headdress, red flowers, beads. Holds round folding fan with lady, trees, bird. Neon lightning-bolt lamp (⚡️), bright yellow glow, above extended left palm. Soft-lit outdoor night background, silhouetted tiered pagoda (西安大雁塔), blurred colorful distant lights."
height = 1024
width = 1024
seed = 42

#hf_path = "https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/blob/main/z_image_turbo-Q3_K_M.gguf"
local_path = "path\to\local\model\z_image_turbo-Q3_K_M.gguf"

transformer = ZImageTransformer2DModel.from_single_file(
    local_path,
    quantization_config=GGUFQuantizationConfig(compute_dtype=torch.bfloat16),
    dtype=torch.bfloat16,
)

pipeline = ZImagePipeline.from_pretrained(
    "Tongyi-MAI/Z-Image-Turbo",
    transformer=transformer,
    dtype=torch.bfloat16,
).to("cuda")

# [Optional] Attention Backend
# Diffusers uses SDPA by default. Switch to Custom attention backend for better efficiency if supported:
#pipeline.transformer.set_attention_backend("_sage_qk_int8_pv_fp16_triton") # Enable Sage Attention
#pipeline.transformer.set_attention_backend("flash") # Enable Flash-Attention-2
#pipeline.transformer.set_attention_backend("_flash_3") # Enable Flash-Attention-3

# [Optional] Model Compilation
# Compiling the DiT model accelerates inference, but the first run will take longer to compile.
#pipeline.transformer.compile()

# [Optional] CPU Offloading
# Enable CPU offloading for memory-constrained devices.
#pipeline.enable_model_cpu_offload()

images = pipeline(
    prompt=prompt,
    num_inference_steps=9, # This actually results in 8 DiT forwards
    guidance_scale=0.0, # Guidance should be 0 for the Turbo models
    height=height,
    width=width,
    generator=torch.Generator("cuda").manual_seed(seed)
).images[0]

images.save("zimage.png")

Credits

  • Original Model: Z-Image Turbo by Tongyi-MAI
  • Quantization Tools & Guide: llama.cpp & city96

License

This repository follows the same license as the Z-Image Turbo.

Capabilities & Tags
gguftext-to-imageimage-generation
Links & Resources
Specifications
CategoryImage
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
3.2

Try Z Image Turbo GGUF

Access the model directly