Image

I unintentionally scared myself by using the I2V generation model

While experimenting with the video generation model, I had the idea of taking a picture of my room and using…

4 months ago

Hunyuan 3D 2.1 released today – Model, HF Demo, Github links on X

submitted by /u/SysPsych [link] [comments]

4 months ago

Volumetric 3D in ComfyUI , node available !

✨ Introducing ComfyUI-8iPlayer: Seamlessly integrate 8i volumetric videos into your AI workflows! https://github.com/Kartel-ai/ComfyUI-8iPlayer/ Load holograms, animate cameras, capture frames, and…

4 months ago

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

submitted by /u/hippynox [link] [comments]

4 months ago

Real time video generation is finally real

Introducing Self-Forcing, a new paradigm for training autoregressive diffusion models. The key to high quality? Simulate the inference process during…

4 months ago

I dunno how to call this lora, UltraReal – Flux.dev lora

Who needs a fancy name when the shadows and highlights do all the talking? This experimental LoRA is the scrappy…

4 months ago

Chatterbox TTS fork *HUGE UPDATE*: 3X Speed increase, Whisper Sync audio validation, text replacement, and more

Check out all the new features here: https://github.com/petermg/Chatterbox-TTS-Extended Just over a week ago Chatterbox was released here: https://www.reddit.com/r/StableDiffusion/comments/1kzedue/mod_of_chatterbox_tts_now_accepts_text_files_as/ I made…

4 months ago

The 8 Rules of Open-Source Generative AI Club!

Fully made with open-source tools within ComfyUI: - Image: UltraReal Finetune (Flux 1 Dev) + Redux + Tyler Durden (Brad…

4 months ago

Elevenlabs v3 is sick

This's going to change the face how audiobooks are made. Hope opensource models catch this up soon! submitted by /u/pheonis2…

4 months ago