We may have a new SOTA open-source model: ERNIE-Image Comparisons

We may have a new SOTA open-source model: ERNIE-Image Comparisons

Base model is definitely SOTA, can even easily compete with closed-source ones in terms of aesthetic. Cinematic quality and color grading is next level.

Base model is heavily biased on Asian faces, while it excels on anime/illustration style, while my base model anime/illustration experiments wasn’t that good. Higher CFG is slightly better with anime on base.

Generated with RTX6000 Blackwell Pro, Base: 29 sec 1.9it/s, 50 steps | Turbo: 2 sec, 3.9i5/s, 8 steps

If you interested seeing them in original size: https://imgur.com/a/75jcjzW

ComfyUI models: https://huggingface.co/Comfy-Org/ERNIE-Image/tree/main
Workflow should appear in Templates after updating the ComfyUI to latest.

Turbo: Ernie-Image Turbo
Base: Ernie-Image

submitted by /u/sktksm
[link] [comments]