Loading

Z-Image Turbo — Open-Source 6B Text-to-Image from Alibaba Tongyi Lab

Introducing Z-Image Turbo

Z-Image Turbo is a 6-billion-parameter text-to-image model from Alibaba's Tongyi Lab — the team behind Qwen — released as open source on November 26, 2025 under the Apache 2.0 license. Distilled to just eight sampling steps via Decoupled-DMD, it generates high-quality images in seconds, delivers native bilingual Chinese and English text rendering, and ranks #1 among open-source image models on the Artificial Analysis leaderboard.

Z-Image Turbo is built on a Scalable Single-Stream DiT (S3-DiT) architecture, in which text, semantic vision tokens, and VAE image tokens are concatenated into a unified input stream — a design Tongyi Lab uses to maximize parameter efficiency at the 6B scale. The base Z-Image model is distilled into Z-Image Turbo via Decoupled-DMD, collapsing inference to eight function evaluations, and aligned to human aesthetic preference with DPO and GRPO. The result is sub-second latency on data-center GPUs and comfortable inference on consumer cards with 16 GB of VRAM.

Native bilingual text rendering is the model's strongest differentiator. Z-Image Turbo handles complex Chinese typography — signage, posters, packaging — alongside English text in the same image, a capability most Western image models still struggle with. As of early 2026, Z-Image Turbo holds the #1 position among open-source image models on the Artificial Analysis Text-to-Image Leaderboard and the top open-source slot on Alibaba AI Arena, with weights freely available on Hugging Face and ModelScope under Apache 2.0 (commercial use permitted).

On LoveGen AI, Z-Image Turbo accepts prompts up to 2000 characters and offers nine preset aspect ratios — 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 2:1, and 1:2 — alongside custom dimensions in the 376–1536 pixel range. A deterministic seed parameter (1 to 2,147,483,647) makes outputs reproducible for iteration and batch work. End-to-end generation typically completes in around ten seconds for 0.1 credit per image, making Z-Image Turbo our most cost-efficient text-to-image option — well-suited to high-volume social content, Chinese-language creative work, and rapid concept exploration. Generated image URLs remain valid for 24 hours.

How to Use Z-Image Turbo

01

Write Your Prompt

Describe the image you want in up to 2000 characters. Be specific about subject, style, lighting, and composition for the best results.

02

Pick an Aspect Ratio

Choose one of nine preset aspect ratios that fits your destination — square for social, 16:9 for thumbnails, 9:16 for vertical video covers.

03

Generate & Save

Click Generate. Your image arrives in roughly ten seconds. Download it within 24 hours since the generated link expires after that.

Z-Image Turbo Technical Specifications

DeveloperAlibaba Tongyi Lab (Tongyi-MAI)
Release DateNovember 26, 2025
LicenseApache 2.0 (open-source, commercial use permitted)
ArchitectureScalable Single-Stream DiT (S3-DiT)
Parameters6 billion
Inference Steps8 (distilled via Decoupled-DMD)
ModeText-to-image
Native LanguagesChinese + English text rendering
Estimated Generation Time~10 seconds end-to-end
Prompt LengthUp to 2000 characters
Aspect Ratios1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 2:1, 1:2
Custom Dimensions376–1536 px (width × height)
ReproducibilitySeed parameter (1 to 2,147,483,647)
Content ModerationAlways-on baseline + strict NSFW filter
Output Validity24 hours (save outputs promptly)
Cost0.1 credit per image

Why Choose Z-Image Turbo

Native Bilingual Text Rendering

Accurate Chinese and English typography in the same image — including signage, posters, and packaging — a capability most Western image models still lack.

Open-Source by Alibaba Tongyi Lab

Built by the team behind Qwen and released under Apache 2.0 in November 2025. Top-ranked open-source image model on the Artificial Analysis leaderboard.

Distilled 6B Architecture

A Single-Stream DiT (S3-DiT) with 6 billion parameters, distilled to just 8 sampling steps via Decoupled-DMD for sub-second inference on GPU.

Lowest Cost per Image

0.1 credit per generation — LoveGen AI's most efficient text-to-image option for high-volume work.

Reproducible with Seeds

A deterministic seed parameter locks in results. Same prompt plus same seed produces consistent output across runs.

Z-Image Turbo vs Other AI Image Generators

FeatureZ-Image TurboGPT Image 2Flux 2 ProIdeogram v3
DeveloperAlibaba Tongyi LabOpenAIBlack Forest LabsIdeogram
LicenseApache 2.0 (open-source)ClosedClosedClosed
Parameters6BUndisclosedUndisclosedUndisclosed
Primary StrengthBilingual text + open-sourceMulti-image editingStudio qualityTypography & branding
Generation Time~10 seconds~30 seconds~30 seconds~15 seconds
Aspect Ratios9 presets + custom3 presets + autoMultipleMultiple
Custom DimensionsYes (376–1536 px)NoYesLimited
Image InputNoUp to 4 imagesUp to 8 imagesNo
Cost per Image0.1 creditHigherHigherHigher
Best ForBilingual content & fast iterationEditing & blendingStudio workLogos & posters

Popular Uses for Z-Image Turbo

01

Rapid Concept Exploration

Generate many variations quickly to explore visual directions for branding, campaigns, or product ideas.

02

Social Media Content at Scale

Produce posts, stories, and ad creatives in any aspect ratio at low per-image cost for high-volume content schedules.

03

Thumbnails & Banners

Use 16:9 and 9:16 presets for video thumbnails and vertical covers, or custom dimensions for site banners.

Explore Related AI Image Generators

Frequently Asked Questions About Z-Image Turbo

Who built Z-Image Turbo?

Z-Image Turbo was developed by Alibaba's Tongyi Lab — the same team behind the Qwen model family — and released as open source under the Apache 2.0 license on November 26, 2025.

How fast is Z-Image Turbo?

Z-Image Turbo is distilled to just 8 sampling steps via Decoupled-DMD, giving sub-second inference on data-center GPUs. End-to-end on LoveGen AI, generation typically completes in around ten seconds.

Can Z-Image Turbo render Chinese and English text?

Yes — native bilingual text rendering is one of Z-Image Turbo's biggest differentiators. The model handles complex Chinese typography, English text, and mixed-language layouts that many Western image models still struggle with.

What aspect ratios does Z-Image Turbo support?

On LoveGen AI, Z-Image Turbo offers nine preset aspect ratios — 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 2:1, and 1:2 — and custom dimensions in the 376–1536 pixel range.

Can I reproduce a specific image with Z-Image Turbo?

Yes. Z-Image Turbo accepts a numeric seed parameter (1 to 2,147,483,647). The same prompt with the same seed produces consistent results, useful for iterating or creating series of related images.

What does Z-Image Turbo cost on LoveGen AI?

Z-Image Turbo costs 0.1 credit per generated image — our most cost-efficient text-to-image model. Generated image URLs remain valid for 24 hours, so download outputs promptly.