The biggest change: we integrated model layer streaming across all local inference pipelines, cutting peak VRAM usage enough to run on 16 GB VRAM machines. This has been one of the most requested changes since launch, and it’s live now.
What else is in 1.0.3:
The VRAM reduction is the one we’re most excited about. The higher VRAM requirement locked out a lot of capable desktop hardware. If your GPU kept you on the sideline, try it now and let us know how it works for you on GitHub.
Already using Desktop? The update downloads automatically.
New here? Download
submitted by /u/ltx_model
[link] [comments]
Ideogram 4 Prompt Builder KJ node rocks. you can make boxes on the canvas and…
This article will teach you how to perform a language task like text classification by…
Today, we are excited to announce the day-zero availability of NVIDIA Nemotron 3 Ultra on…
At Google Cloud, our goal is to let you run large-scale analytical and data science…
The USDA this week confirmed the first known infection of the carnivorous fly larva, which…
A fire alarm jolts you from your office desk, and you head for the nearest…