WAN v2.2 text-to-image for latest WAN quality.
The best image results come from specific composition, style, and lighting language. Be explicit about what should be in frame and what should feel dominant.
Best results start with a precise subject, composition, and style direction.
WAN v2.2 Text to Image on Pixio is Alibaba's text-to-image model offering solid prompt following and balanced quality for photoreal and stylized outputs. Use it when you want WAN-style generation with good realism and control without needing the latest 2.5/2.6 features.
WAN v2.2 Text to Image on Pixio is Alibaba's text-to-image model offering solid prompt following and balanced quality for photoreal and stylized outputs. Use it when you want WAN-style generation with good realism and control without needing the latest 2.5/2.6 features.
| Mode | Input | Best for |
|---|---|---|
| Text to Image | Prompt only | Scenes, characters, products from a single prompt |
| Option | Values | Notes |
|---|---|---|
| Aspect ratio | 1:1, 16:9, 9:16 (check Pixio) | Match deliverable |
| Credits | Plan-based | Check model card in Pixio |
Credits are plan-based; check the model card in Pixio for your plan and cost per image.
[Subject] + [Composition] + [Lighting] + [Style]. One clear concept per prompt; be specific about pose, setting, and mood.
"Portrait of a businessman in a modern office with floor-to-ceiling windows. Soft daylight, shallow depth of field. Professional, photoreal, 8K."
"A bowl of ramen on a wooden table with steam rising. Close-up, warm lighting. Cozy, appetizing, high detail."
"Fantasy castle on a cliff overlooking a misty valley at sunset. Epic scale, dramatic clouds. Digital painting, cinematic."
"Minimalist product shot of a white wireless earbuds case on grey fabric. Soft studio lighting. Clean, commercial, high-end."
| Scenario | Best choice |
|---|---|
| WAN text-to-image, stable version | WAN v2.2 Text to Image |
| Newer WAN text-to-image | WAN 2.6 Text to Image, WAN 2.5 Text to Image |
| WAN image-to-image | WAN 2.6 Image to Image |
| Flux / Google / Ideogram | Flux Pro, Imagen 4, Ideogram Generate V3 |
Tell the model what should dominate the frame first.
Use lighting language early; it changes everything downstream.
When editing, describe what stays, not just what changes.
References help when continuity matters more than novelty.
A strong image prompt defines the subject, composition, lighting, and finish instead of leaving them implied.
Use precise visual language to control subject, composition, lighting, and style from the start.
Preserve the useful parts of the image while steering the rest with masks, references, or prompt edits.
Bring in reference images or LoRAs when consistency is more important than exploration.
WAN v2.2 Text to Image is strongest when the visual brief is specific about framing, style, and what should read first.
Use it for campaign images, product shots, subject consistency, or polished concept work.
When editing, say exactly what changes and what must remain untouched.
Lock the subject, composition, and lighting direction before you chase style nuance.
Use references or edits when the same subject, style, or layout has to survive across versions.
Once the frame works, refine only the weak areas instead of rewriting the whole composition.
Finish strong compositions by scaling them without rebuilding the frame from scratch.
Use editing tools after the initial generation when the composition is right but the details still need polish.