The biggest change: we integrated model layer streaming across all local inference pipelines, cutting peak VRAM usage enough to run on 16 GB VRAM machines. This has been one of the most requested changes since launch, and it’s live now.
What else is in 1.0.3:
The VRAM reduction is the one we’re most excited about. The higher VRAM requirement locked out a lot of capable desktop hardware. If your GPU kept you on the sideline, try it now and let us know how it works for you on GitHub.
Already using Desktop? The update downloads automatically.
New here? Download
submitted by /u/ltx_model
[link] [comments]
Prime Day is officially over, but many of our favorite, hand-picked deals are still available…
What if your smartwatch could tell when you were struggling emotionally and offer support before…
Picture this: a compliance officer needs a specific clause during an audit, an attorney needs…
As enterprises scale autonomous AI agents into production, enabling safe innovation requires robust architectural guardrails.…
Times are hard in 2026. These Amazon Prime Day deals under $100 on earbuds, Kindles,…