This post summarizes a very important livestream with a WAN engineer. It will at least be partially open (model architecture, training code and inference code). Maybe even fully open weights if the community treats them with respect and gratitude, which is also what one of their engineers basically spelled out on Twitter a few days ago, where he asked us to voice our interest in an open model but in a calm and respectful way, because any hostility makes it less likely that the company releases it openly.
The cost to train this kind of model is millions of dollars. Everyone be on your best behaviors. We’re all excited and hoping for the best! I’m already grateful that we’ve been blessed with WAN 2.2 which is already amazing.
PS: The new 1080p/10 seconds mode will probably be far outside consumer hardware reach, but the improvements in the architecture at 480/720p are exciting enough already. It creates such beautiful videos and really good audio tracks. It would be a dream to see a public release, even if we have to quantize it heavily to fit all that data into our consumer GPUs. 😅
submitted by /u/pilkyton
[link] [comments]
It is far more likely that a woman underwater is wearing at least a bikini…
TL;DR AI is already raising unemployment in knowledge industries, and if AI continues progressing toward…
The canonical approach in generative modeling is to split model fitting into two blocks: define…
As organizations increasingly adopt AI capabilities across their applications, the need for centralized management, security,…
From uncovering new insights in multimodal data to personalizing customer experiences, AI is emerging as…
OpenAI has sent out emails notifying API customers that its chatgpt-4o-latest model will be retired…