Categories: Image

Turns out LTX-2 makes a very good video upscaler for WAN

I have had a lot of fun with LTX but for a lot of usecases it is useless for me. for example this usecase where I could not get anything proper with LTX no matter how much I tried (mild nudity):
https://aurelm.com/portfolio/ode-to-the-female-form/
The video may be choppy on the site but you can download it locally. Looks quite good to me and also gets rid of the warping and artefacts from wan and the temporal upscaler also does a damn good job.
First 5 shots were upscaled from 720p to 1440p and the rest are from 440p to 1080p (that’s why they look worse). No upscaling outside Comfy was used.

workwlow in my blog post below. I could not get a proper link of the 2 steps in one run (OOM) so the first group is for wan, second you load the wan video and run with only the second group active.
https://aurelm.com/2026/02/22/using-ltx-2-as-an-upscaler-temporal-and-spatial-for-wan-2-2/

This are the kind of videos I could get from LTX only, sometimes with double faces, twisted heads and all in all milky, blurry.
https://aurelm.com/upload/ComfyUI_01500-audio.mp4
https://aurelm.com/upload/ComfyUI_01501-audio.mp4

Denoising should normally not go above 0.15 otherwise you run into ltx-related issues like blur, distort, artefacts. Also for wan you can set for both samplers the number of steps to 3 for faster iteration.

Sorry for all the unload all models and clearing cache, i chain them and repeat to make sure everything is unloaded to minimize OOM. that I kept getting.

The video was made on a 3090. Around 6 minutes for 6 seconds WAN 720p videos and another 12minutes for each segment upscaling to 2x (1440p aprox).

submitted by /u/aurelm
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Nava – A 6.3B audio-video model .

Page: https://ernie-research.github.io/NAVA/ Model: https://huggingface.co/ernie-research/NAVA Github: https://github.com/ernie-research/NAVA NAVA is a 6.3 B-parameter joint audio-video generator that…

1 hour ago

Enterprise Business Software and the Mixed-Up Chameleon Problem

Editor’s Note: This blog post was written by Greg Little, Senior Counselor at Palantir, with…

1 hour ago

High-Throughput Graph Abstraction at Netflix: Part I

By Oleksii Tkachuk, Kartik Sathyanarayanan, Rajiv ShringiIntroductionNetflix has a diverse range of graph use cases, each…

1 hour ago

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality

Deploying large language models (LLMs) at scale on Amazon SageMaker AI Inference makes observability a…

1 hour ago

Cloud CISO Perspectives: How to build an AI-ready security program for the public sector

Welcome to the second Cloud CISO Perspectives for May 2026. Today, Usman Chaudhary, Field CISO,…

1 hour ago

24 Best Father’s Day Gifts for Dads (2026)

Dads are traditionally tough to shop for—let me help with these handpicked gift ideas for…

2 hours ago