Categories: Image

WAN2.5-Preview: They are collecting feedback to fine-tune this PREVIEW. The full release will have open training + inference code. The weights MAY be released, but not decided yet. WAN2.5 demands SIGNIFICANTLY more VRAM due to being 1080p and 10 seconds. Final system requirements unknown! (@50:57)

This post summarizes a very important livestream with a WAN engineer. It will at least be partially open (model architecture, training code and inference code). Maybe even fully open weights if the community treats them with respect and gratitude, which is also what one of their engineers basically spelled out on Twitter a few days ago, where he asked us to voice our interest in an open model but in a calm and respectful way, because any hostility makes it less likely that the company releases it openly.

The cost to train this kind of model is millions of dollars. Everyone be on your best behaviors. We’re all excited and hoping for the best! I’m already grateful that we’ve been blessed with WAN 2.2 which is already amazing.

PS: The new 1080p/10 seconds mode will probably be far outside consumer hardware reach, but the improvements in the architecture at 480/720p are exciting enough already. It creates such beautiful videos and really good audio tracks. It would be a dream to see a public release, even if we have to quantize it heavily to fit all that data into our consumer GPUs. 😅

submitted by /u/pilkyton
[link] [comments]

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

April 18, 2023

In "FAANG"

Connect Spark data pipelines to Gemini and other AI models with Dataproc ML library

Many data science teams rely on Apache Spark running on Dataproc managed clusters for powerful, large-scale data preparation. As these teams look to connect their data pipelines directly to machine learning models, there's a clear opportunity to simplify the integration. But running inference on a Spark DataFrame using a model…

October 4, 2025

In "FAANG"

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of results, and enabling investigations into data and model biases, as well as potential risks. To this end, we release OpenELM, a state-of-the-art open language model. OpenELM uses a layer-wise scaling strategy to…

April 25, 2024

In "FAANG"

AI Generated Robotic Content

Next Lightweight framework enables faster, more accurate object detection for UAV remote sensing »

Previous « 5 AI Agent Projects for Beginners

Published by

AI Generated Robotic Content

Tags: ai images

4 months ago

How to Read a Machine Learning Research Paper in 2026

When I first started reading machine learning research papers, I honestly thought something was wrong…

3 hours ago

FAANG

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Our latest Veo update generates lively, dynamic clips that feel natural and engaging — and…

3 hours ago

FAANG

Securing Amazon Bedrock cross-Region inference: Geographic and global

The adoption and implementation of generative AI inference has increased with organizations building more operational…

3 hours ago

FAANG

A gRPC transport for the Model Context Protocol

AI agents are moving from test environments to the core of enterprise operations, where they…

3 hours ago

AI/ML News

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company's workplace assistant, transforming…

4 hours ago

AI/ML News

The Fight on Capitol Hill to Make It Easier to Fix Your Car

As vehicles grow more software-dependent, repairing them has become harder than ever. A bill in…

4 hours ago

WAN2.5-Preview: They are collecting feedback to fine-tune this PREVIEW. The full release will have open training + inference code. The weights MAY be released, but not decided yet. WAN2.5 demands SIGNIFICANTLY more VRAM due to being 1080p and 10 seconds. Final system requirements unknown! (@50:57)

Recent Posts