Categories: Image

Text-to-image comparison. FLUX.1 Krea [dev] Vs. Wan2.2-T2V-14B (Best of 5)

Note, this is not a “scientific test” but a best of 5 across both models. So in all 35 images for each so will give a general impression further down.

Exciting that text-to-image is getting some love again. As others have discovered Wan is very good as a image model. So I was trying to get a style which is typically not easy. A type of “boring” TV drama still with a realistic look. I didn’t want to go all action movie like because being able to create more subtle images I find a lot more interesting.

Images alternate between FLUX.1 Krea [dev] first (odd image numbers) then Wan2.2-T2V-14B(even image numbers)

The prompts were longish natural language prompts 150 or so words.

FLUX1. Krea was default settings except for lowering CFG from 3.5 to 2. 25 steps

Wan2.2-T2V-14B was a basic t2v workflow using the Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32 lora at 0.6 stength to speed but that obviusly does have a visual impact (good or bad).

General observations.

The Flux model had a lot more errors, with wonky hands, odd anatomy etc. I’d say 4 out of 5 were very usable from Wan, but only 1 or less was for Flux.

Flux also really didn’t like freckles for some reason. And gave a much more contrasty look which I didn’t ask for however the lighting in general was more accurate for Flux.

Overall I think Wan’s images look a lot more natural in the facial expressions and body language.

Be intersted to hear what you think. I know this isn’t exhaustive in the least but I found it interesting atleast.

submitted by /u/legarth
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

New fire just dropped: ComfyUI-CacheDiT ⚡

ComfyUI-CacheDiT brings 1.4-1.6x speedup to DiT (Diffusion Transformer) models through intelligent residual caching, with zero…

15 hours ago

A Beginner’s Reading List for Large Language Models for 2026

  The large language models (LLMs) hype wave shows no sign of fading anytime soon:…

15 hours ago

How Clarus Care uses Amazon Bedrock to deliver conversational contact center interactions

This post was cowritten by Rishi Srivastava and Scott Reynolds from Clarus Care. Many healthcare…

15 hours ago

Build intelligent employee onboarding with Gemini Enterprise

Employee onboarding is rarely a linear process. It’s a complex web of dependencies that vary…

15 hours ago

Epstein Files Reveal Peter Thiel’s Elaborate Dietary Restrictions

The latest batch of Jeffrey Epstein files shed light on the convicted sex offender’s ties…

16 hours ago

A tiny light trap could unlock million qubit quantum computers

A new light-based breakthrough could help quantum computers finally scale up. Stanford researchers created miniature…

16 hours ago