Categories: Image

End-of-January LTX-2 Drop: More Control, Faster Iteration

We just shipped a new LTX-2 drop focused on one thing: making video generation easier to iterate on without killing VRAM, consistency, or sync.

If you’ve been frustrated by LTX because prompt iteration was slow or outputs felt brittle, this update is aimed directly at that.

Here’s the highlights, the full details are here.

What’s New

Faster prompt iteration (Gemma text encoding nodes)
Why you should care: no more constant VRAM loading and unloading on consumer GPUs.

New ComfyUI nodes let you save and reuse text encodings, or run Gemma encoding through our free API when running LTX locally.

This makes Detailer and iterative flows much faster and less painful.

Independent control over prompt accuracy, stability, and sync (Multimodal Guider)
Why you should care: you can now tune quality without breaking something else.

The new Multimodal Guider lets you control:

Prompt adherence
Visual stability over time
Audio-video synchronization

Each can be tuned independently, per modality. No more choosing between “follows the prompt” and “doesn’t fall apart.”

More practical fine-tuning + faster inference
Why you should care: better behavior on real hardware.

Trainer updates improve memory usage and make fine-tuning more predictable on constrained GPUs.

Inference is also faster for video-to-video by downscaling the reference video before cross-attention, reducing compute cost. (Speedup depend on resolution and clip length.)

We’ve also shipped new ComfyUI nodes and a unified LoRA to support these changes.

What’s Next

This drop isn’t a one-off. The next LTX-2 version is already in progress, focused on:

Better fine detail and visual fidelity (new VAE)
Improved consistency to conditioning inputs
Cleaner, more reliable audio
Stronger image-to-video behavior
Better prompt understanding and color handling

Try It and Stress It!

If you’re pushing LTX-2 in real workflows, your feedback directly shapes what we build next. Try the update, break it, and tell us what still feels off in our Discord.

submitted by /u/ltx_model
[link] [comments]

LTX 2 is amazing : LTX-2 in ComfyUI on RTX 3060 12GB

My setup: RTX 3060 12GB VRAM + 48GB system RAM. I spent the last couple of days messing around with LTX-2 inside ComfyUI and had an absolute blast. I created short sample scenes for a loose spy story set in a neon-soaked, rainy Dhaka (cyberpunk/Bangla vibes with rainy streets, umbrellas,…

January 18, 2026

In "Image"

Turns out LTX-2 makes a very good video upscaler for WAN

I have had a lot of fun with LTX but for a lot of usecases it is useless for me. for example this usecase where I could not get anything proper with LTX no matter how much I tried (mild nudity): https://aurelm.com/portfolio/ode-to-the-female-form/ The video may be choppy on the site…

February 22, 2026

In "Image"

How to generate proper Japanese in LTX-2

So, after the recent anime clip posted here a few days ago that got a lot of praise for the visuals, I noticed the Japanese audio was actually mostly gibberish, but good enough to sound like Japanese to the untrained ear. This was a real bummer to me since, all…

January 19, 2026

In "Image"

AI Generated Robotic Content