Categories: Image

Text-to-image comparison. FLUX.1 Krea [dev] Vs. Wan2.2-T2V-14B (Best of 5)

Note, this is not a “scientific test” but a best of 5 across both models. So in all 35 images for each so will give a general impression further down.

Exciting that text-to-image is getting some love again. As others have discovered Wan is very good as a image model. So I was trying to get a style which is typically not easy. A type of “boring” TV drama still with a realistic look. I didn’t want to go all action movie like because being able to create more subtle images I find a lot more interesting.

Images alternate between FLUX.1 Krea [dev] first (odd image numbers) then Wan2.2-T2V-14B(even image numbers)

The prompts were longish natural language prompts 150 or so words.

FLUX1. Krea was default settings except for lowering CFG from 3.5 to 2. 25 steps

Wan2.2-T2V-14B was a basic t2v workflow using the Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32 lora at 0.6 stength to speed but that obviusly does have a visual impact (good or bad).

General observations.

The Flux model had a lot more errors, with wonky hands, odd anatomy etc. I’d say 4 out of 5 were very usable from Wan, but only 1 or less was for Flux.

Flux also really didn’t like freckles for some reason. And gave a much more contrasty look which I didn’t ask for however the lighting in general was more accurate for Flux.

Overall I think Wan’s images look a lot more natural in the facial expressions and body language.

Be intersted to hear what you think. I know this isn’t exhaustive in the least but I found it interesting atleast.

submitted by /u/legarth
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

What tools would you use to make morphing videos like this?

submitted by /u/nikitagent [link] [comments]

17 hours ago

Bias after Prompting: Persistent Discrimination in Large Language Models

A dangerous assumption that can be made from prior work on the bias transfer hypothesis…

17 hours ago

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

Author: Keertana Chidambaram, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya(*The work was done when Keertana interned…

17 hours ago

When your AI browser becomes your enemy: The Comet security disaster

Remember when browsers were simple? You clicked a link, a page loaded, maybe you filled…

18 hours ago

Baseus Inspire XC1 Review: Excellent Open Earbuds

These affordable open buds come with Bose-crafted sound.

18 hours ago

DeepMind introduces AI agent that learns to complete various tasks in a scalable world model

Over the past decade, deep learning has transformed how artificial intelligence (AI) agents perceive and…

18 hours ago