Hunyuan 3D 2.1 released today – Model, HF Demo, Github links on X
submitted by /u/SysPsych [link] [comments]
submitted by /u/SysPsych [link] [comments]
✨ Introducing ComfyUI-8iPlayer: Seamlessly integrate 8i volumetric videos into your AI workflows! https://github.com/Kartel-ai/ComfyUI-8iPlayer/ Load holograms, animate cameras, capture frames, and feed them to your favorite AI models. The future of 3D content creation is here!Developed by me for Kartel.ai 🚀Note: There might be a few bugs, but I hope people can play with it! #AI …
submitted by /u/hippynox [link] [comments]
Introducing Self-Forcing, a new paradigm for training autoregressive diffusion models. The key to high quality? Simulate the inference process during training by unrolling transformers with KV caching. project website: https://self-forcing.github.io Code/models: https://github.com/guandeh17/Self-Forcing Source: https://x.com/xunhuang1995/status/1932107954574275059?t=Zh6axAeHtYJ8KRPTeK1T7g&s=19 submitted by /u/cjsalva [link] [comments]
submitted by /u/hippynox [link] [comments]
Who needs a fancy name when the shadows and highlights do all the talking? This experimental LoRA is the scrappy cousin of my Samsung one—same punchy light-and-shadow mojo, but trained on a chaotic mix of pics from my ancient phones (so no Samsung for now). You can check it here: https://civitai.com/models/1662740?modelVersionId=1881976 submitted by /u/FortranUA [link] …
Read more “I dunno how to call this lora, UltraReal – Flux.dev lora”
Check out all the new features here: https://github.com/petermg/Chatterbox-TTS-Extended Just over a week ago Chatterbox was released here: https://www.reddit.com/r/StableDiffusion/comments/1kzedue/mod_of_chatterbox_tts_now_accepts_text_files_as/ I made a couple posts of the fork I had made and was working on but this update is even bigger than before. submitted by /u/omni_shaNker [link] [comments]
Fully made with open-source tools within ComfyUI: – Image: UltraReal Finetune (Flux 1 Dev) + Redux + Tyler Durden (Brad Pitt) Lora > Flux Fill Inpaint – Video Model: Wan 2.1 Fun Control 14B + DW Pose* – Upscaling : 2xNomosUNI esrgan + Wan 2.1 T2V 1.3B (low denoise) – Interpolation: Rife 47 – Voice …
This’s going to change the face how audiobooks are made. Hope opensource models catch this up soon! submitted by /u/pheonis2 [link] [comments]
I’ve been active on this sub basically since SD 1.5, and whenever something new comes out that ranges from “doesn’t totally suck” to “Amazing,” it gets wall to wall threads blanketing the entire sub during what I’ve come to view as a new model “Honeymoon” phase. All a model needs to get this kind of …