Categories: Image

WAN2.5-Preview: They are collecting feedback to fine-tune this PREVIEW. The full release will have open training + inference code. The weights MAY be released, but not decided yet. WAN2.5 demands SIGNIFICANTLY more VRAM due to being 1080p and 10 seconds. Final system requirements unknown! (@50:57)

This post summarizes a very important livestream with a WAN engineer. It will at least be partially open (model architecture, training code and inference code). Maybe even fully open weights if the community treats them with respect and gratitude, which is also what one of their engineers basically spelled out on Twitter a few days ago, where he asked us to voice our interest in an open model but in a calm and respectful way, because any hostility makes it less likely that the company releases it openly.

The cost to train this kind of model is millions of dollars. Everyone be on your best behaviors. We’re all excited and hoping for the best! I’m already grateful that we’ve been blessed with WAN 2.2 which is already amazing.

PS: The new 1080p/10 seconds mode will probably be far outside consumer hardware reach, but the improvements in the architecture at 480/720p are exciting enough already. It creates such beautiful videos and really good audio tracks. It would be a dream to see a public release, even if we have to quantize it heavily to fit all that data into our consumer GPUs. 😅

submitted by /u/pilkyton
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Workflow upscale/magnify video from Sora with Wan , based on cseti007

📦 : https://github.com/lovisdotio/workflow-magnify-upscale-video-comfyui-lovis I did this ComfyUI workflow for Sora 2 upscaling 🚀 ( or…

14 hours ago

The Complete Guide to Pydantic for Python Developers

Python's flexibility with data types is convenient when coding, but it can lead to runtime…

14 hours ago

Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping

We revisit scene-level 3D object detection as the output of an object-centric framework capable of…

14 hours ago

Inside the AIPCon 8 Demos Transforming Manufacturing, Insurance, and Construction

Editor’s Note: This is the second in a two-part series highlighting demo sessions from AIPCon…

14 hours ago

Responsible AI design in healthcare and life sciences

Generative AI has emerged as a transformative technology in healthcare, driving digital transformation in essential…

14 hours ago

5 ad agencies used Gemini 2.5 Pro and gen media models to create an “impossible ad”

The conversation around generative AI in the enterprise is getting creative.  Since launching our popular…

14 hours ago