ai images

PUSA fails go hard

submitted by /u/JackKerawock [link] [comments]

10 months ago

Bytedance release the full safetensor model for UMO – Multi-Identity Consistency for Image Customization . Obligatory beg for a ComfyUI node 🙏🙏

https://huggingface.co/bytedance-research/UMO https://arxiv.org/pdf/2509.06818 Bytedance have released 3 days ago their image editing/creation model UMO. From their huggingface description: Recent advancements in…

10 months ago

RecA: A new finetuning method that doesn’t use image captions.

https://arxiv.org/abs/2509.07295 "We introduce Reconstruction Alignment (RecA), a resource-efficient post-training method that leverages visual understanding encoder embeddings as dense "text prompts,"…

10 months ago

WAN 2.2 Animation – Fixed Slow Motion

I created this animation as part of my tests to find the balance between image quality and motion in low-step…

10 months ago

This sub has had a distinct lack of dancing 1girls lately

So many posts with actual new model releases and technical progression, why can't we go back to the good old…

10 months ago

Just tried HunyuanImage 2.1

Hey guys, I just tested out the new HunyuanImage 2.1 model on HF and… wow. It’s completely uncensored. It even…

10 months ago

Clothes Try On (Clothing Transfer) – Qwen Edit Loraa

Patreon Blog Post CivitAI Download Hey all, as promised here is that Outfit Try On Qwen Image edit LORA I…

10 months ago

How can I do this on Wan Vace?

I know wan can be used with pose estimators for TextV2V, but I'm unsure about reference images to videos. The…

10 months ago

Sydney’s Comfy Tips

Made with Kijai's infiniteTalk workflow and Higgs Audio for the voice. https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo_I2V_InfiniteTalk_example_02.json https://huggingface.co/bosonai/higgs-audio-v2-generation-3B-base submitted by /u/Race88 [link] [comments]

10 months ago