Categories: Image

LTX 2 is amazing : LTX-2 in ComfyUI on RTX 3060 12GB

My setup: RTX 3060 12GB VRAM + 48GB system RAM.

I spent the last couple of days messing around with LTX-2 inside ComfyUI and had an absolute blast. I created short sample scenes for a loose spy story set in a neon-soaked, rainy Dhaka (cyberpunk/Bangla vibes with rainy streets, umbrellas, dramatic reflections, and a mysterious female lead).

Workflow : https://drive.google.com/file/d/1VYrKf7jq52BIi43mZpsP8QCypr9oHtCO/view
i forgot the username who shared it under a post. This workflow worked really well!

Each 8-second scene took about 12 minutes to generate (with synced audio). I queued up 70+ scenes total, often trying 3-4 prompt variations per scene to get the mood right. Some scenes were pure text-to-video, others image-to-video starting from Midjourney stills I generated for consistency.

Here’s a compilation of some of my favorite clips (rainy window reflections, coffee steam morphing into faces, walking through crowded neon markets, intense close-ups in the downpour):

i cleaned up the audio. it had some squeaky sounds.

Strengths that blew me away:

Speed – Seriously fast for what it delivers, especially compared to other local video models.
Audio sync is legitimately impressive. I tested illustration styles, anime-ish looks, realistic characters, and even puppet/weird abstract shapes – lip sync, ambient rain, subtle SFX/music all line up way better than I expected. Achieving this level of quality on just 12GB VRAM is wild.
Handles non-realistic/abstract content extremely well – illustrations, stylized/puppet-like figures, surreal elements (like steam forming faces or exaggerated rain effects) come out coherent and beautiful.

Weaknesses / Things to avoid:

Weird random zoom-in effects pop up sometimes – not sure if prompt-related or model quirk.
Actions/motion-heavy scenes just don’t work reliably yet. Keep it to subtle movements, expressions, atmosphere, rain, steam, walking slowly, etc. – anything dynamic tends to break coherence.

Overall verdict: I literally couldn’t believe how two full days disappeared – I was having way too much fun iterating prompts and watching the queue. LTX-2 feels like a huge step forward for local audio-video gen, especially if you lean into atmospheric/illustrative styles rather than high-action.

submitted by /u/tanzim31
[link] [comments]

Deni Avdija in Space Jam with LTX-2 I2V + iCloRA. Flow included

made a short video with LTX-2 using an iCloRA Flow to recreate a Space Jam scene, but swap Michael Jordan with Deni Avdija. Flow (GitHub): https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/LTX-2_ICLoRA_All_Distilled.json My process: I generated an image of each shot that matches the original as closely as possible just replacing MJ with Deni. I loaded…

February 7, 2026

In "Image"

End-of-January LTX-2 Drop: More Control, Faster Iteration

We just shipped a new LTX-2 drop focused on one thing: making video generation easier to iterate on without killing VRAM, consistency, or sync. If you’ve been frustrated by LTX because prompt iteration was slow or outputs felt brittle, this update is aimed directly at that. Here’s the highlights, the…

January 30, 2026

In "Image"

Model Drop | ZIT + LTX 2.3 + Music Video | Arca Gidan contest

The idea came from something I'm pretty sure most of us live every single day: you wake up, check your phone, and another model has dropped. Open source, closed source, whatever source — faster, smarter, more creative, more powerful. And before you've even had coffee, you're already reworking a ComfyUI…

April 5, 2026

In "Image"

AI Generated Robotic Content