Categories: Image

CEO Thoughts: What’s Next at LTX

Zeev, CEO of LTX, here. Wanted to pull back the curtain on the technical bets we’re making and where they’re headed. Happy to go deep in the comments.

We’ve been heads down on the next generation of LTX, and I want to share what’s coming. Not the long-term vision post (that’s coming separately), just a concrete look at what we’re building right now and what you’ll see soon.

The next release of LTX-2 is focused on generation quality across the board. As usual, more data, more compute, and this time around two architectural flavors: a dense model and the mixture-of-experts to accommodate different speed and quality trade-offs.

The mixture-of-experts (MoE) is a fundamental architectural shift where the model activates only the parts it needs for a given generation. This lets us scale capability and quality without paying for it linearly in compute. It’s the kind of change that doesn’t show up in a single demo but fundamentally changes what the model can do at a given cost.

With both dense and MoE, we are going to ship a significantly more capable text encoder. The result is a model that better understands what you wrote, including complex, multi-shot prompts that older architecture tended to flatten or ignore. We are also investing heavily in performance and memory: newer attention kernels and improved low-precision support mean the latest model runs well across a wider range of hardware.

Now, the part I think this community will really care about as well. We’re opening up more of the training infrastructure: new trainer recipes and LoRA training tooling so you can build domain-specific model variants on top of LTX, not just use the base weights as-is. Think specialized flavors for use cases like human motion, product visualization, and architectural environments, each fine-tuned from the same foundation but optimized for a specific domain. On the enterprise side, this extends into a post-training customization layer that lets teams fine-tune on proprietary data without retraining from scratch. The full picture is three tiers: a base foundation model, domain-specific trainer configurations, and a customer customization layer on top.

To be clear: we’re committed to keeping the weights open. The base model, the derivatives, the tooling. This isn’t a bait-and-switch where we open-source early and close up once the model gets good enough to monetize. Openness is how we build, and the community building on top of our models will always reach further than any single team working alone.

One more thing we’re exploring, and we think it could be a real leap in output quality: a diffusion-based decoder that replaces the traditional VAE for converting latents back into pixels. The potential is sharper, higher-resolution output that combines decoding and upscaling into a single step. We’re actively experimenting with it in our latent space. This is the kind of architectural bet that could change the standard of video generation and we hope open models will lead it.

We also know the model is only half the story. There’s still a real gap between “the model works” and “I can ship a finished product on this,” and closing it matters as much to us as any model improvement. We are overhauling our documentation and launching reference implementations to show exactly what good deployment looks like in practice.

More to come soon. In the meantime, tell us what you want us to prioritize.

— Zeev

https://preview.redd.it/mky84vcaop6h1.png?width=1920&format=png&auto=webp&s=67a08c4b282e57a1f465a3e30a38e9df26bf21b8

submitted by /u/ltx_model
[link] [comments]

End-of-January LTX-2 Drop: More Control, Faster Iteration

We just shipped a new LTX-2 drop focused on one thing: making video generation easier to iterate on without killing VRAM, consistency, or sync. If you’ve been frustrated by LTX because prompt iteration was slow or outputs felt brittle, this update is aimed directly at that. Here’s the highlights, the…

January 30, 2026

In "Image"

I’m the Co-founder & CEO of Lightricks. We just open-sourced LTX-2, a production-ready audio-video AI model. AMA.

Hi everyone. I’m Zeev Farbman, Co-founder & CEO of Lightricks. I’ve spent the last few years working closely with our team on LTX-2, a production-ready audio–video foundation model. This week, we did a full open-source release of LTX-2, including weights, code, a trainer, benchmarks, LoRAs, and documentation. Open releases of…

January 9, 2026

In "Image"

LTX-2 Image-to-Video Adapter LoRA

https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa A high-rank LoRA adapter for LTX-Video 2 that substantially improves image-to-video generation quality. No complex workflows, no image preprocessing, no compression tricks -- just a direct image embedding pipeline that works. What This Is Out of the box, getting LTX-2 to reliably infer motion from a single image requires…

January 27, 2026

In "Image"

AI Generated Robotic Content