| | Installed VibeVoice using the wrapper this dude created. https://www.reddit.com/r/comfyui/comments/1n20407/wip2_comfyui_wrapper_for_microsofts_new_vibevoice/ Workflow is the multi-voice example one can find in the module’s folder. Asked GPT for a harmless talk among those 3 people, used 3 1-minute audio samples, mono, 44KHz .wav Picked the 7B model. My 3060 almost died, took 54 minutes, but she didn’t croak an OOM error, brave girl resisted, and the results are amazing. This is the first one, no edits, no retries. I’m impressed. submitted by /u/nazihater3000 |
ComfyUI-CacheDiT brings 1.4-1.6x speedup to DiT (Diffusion Transformer) models through intelligent residual caching, with zero…
The large language models (LLMs) hype wave shows no sign of fading anytime soon:…
This post was cowritten by Rishi Srivastava and Scott Reynolds from Clarus Care. Many healthcare…
Employee onboarding is rarely a linear process. It’s a complex web of dependencies that vary…
The latest batch of Jeffrey Epstein files shed light on the convicted sex offender’s ties…
A new light-based breakthrough could help quantum computers finally scale up. Stanford researchers created miniature…