by T5B
Open source · 92k downloads · 159 likes
Z Image Turbo FP8 is an optimized version of the Z-Image-Turbo model with reduced precision (FP8), designed to deliver enhanced performance while minimizing memory usage. This model leverages two quantization formats (E5M2 and E4M3FN) to accelerate inference while maintaining high visual quality, making it ideal for resource-constrained environments. It excels in generating and editing images from text prompts with superior speed and energy efficiency compared to standard-precision versions. Key use cases include creating images for marketing, digital art, or rapid prototyping, as well as integration into applications requiring short response times. What sets it apart is its ability to balance performance and lightweight design while adhering to the original model’s usage constraints.
This is a quantization of Tongyi-MAI/Z-Image-Turbo to FP8 E5M2 and FP8 E4M3FN.
License & Usage: This model strictly follows the original licensing terms and usage restrictions. Please refer to the original model card for details.