Everyone can now fly their own drone.

TL;DR Using Google’s new Veo 3.1 video model, we created a breathtaking 1 minute 40 second FPV drone flight through mountain valleys, and it took just 15 minutes to generate. Imagine soaring through alpine valleys, gliding between snowy peaks, and diving toward rivers that twist like silver ribbons below, all without leaving your desk. That’s …

CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching

Conditional generative modeling aims to learn a conditional data distribution from samples containing data-condition pairs. For this, diffusion and flow-based methods have attained compelling results. These methods use a learned (flow) model to transport an initial standard Gaussian noise that ignores the condition to the conditional data distribution. The model is hence required to learn …

1 hzHgTuYmax 1000x1000 1

Announcing BigQuery-managed AI functions for better SQL

For decades, SQL has been the universal language for data analysis, offering access to analytics on structured data. Large Language Models (LLMs) like Gemini now provide a path to get nuanced insights from unstructured data such as text, image and video. However, integrating LLMs into standard SQL flow requires data movement, at least some prompt …

Weibo’s new open source AI model VibeThinker-1.5B outperforms DeepSeek-R1 on $7,800 post-training budget

Another day in late 2025, another impressive result from a Chinese company in open source artificial intelligence. Chinese social networking company Weibo’s AI division recently released its open source VibeThinker-1.5B—a 1.5 billion parameter large language model (LLM) that is a fine-tuned variant of rival Chinese tech firm Alibaba’s Qwen2.5-Math-1.5B. It’s available now for free download …

When AI draws our words: Study finds image generators fail basic instructions despite aesthetic success

Can we really trust artificial intelligence to illustrate our ideas? A team of scientists has examined the capabilities of Midjourney and DALL·E—two Generative Artificial Intelligence (GAI) software programs—to produce images from simple sentences. The verdict is mixed… between aesthetic feats and beginner’s mistakes, machines still have a long way to go.

This Is a Weapon of Choice (Wan2.2 Animate)

I used a workflow from here: https://github.com/IAMCCS/comfyui-iamccs-workflows/tree/main Specifically this one: https://github.com/IAMCCS/comfyui-iamccs-workflows/blob/main/C_IAMCCS_NATIVE_WANANIMATE_LONG_VIDEO_v.1.json submitted by /u/sutrik [link] [comments]

ML 19929 1

Introducing agent-to-agent protocol support in Amazon Bedrock AgentCore Runtime

We recently announced the support for Agent-to-Agent (A2A) protocol on Amazon Bedrock AgentCore Runtime. With this addition, agents can discover peers, share capabilities, and coordinate actions across platforms using standardized communication. Amazon Bedrock AgentCore Runtime provides a secure, serverless environment designed for deploying AI agents and tools. It works with any framework and model, supports …

BigQuery under the hood: How Google brought embeddings to analytics

Embeddings are a crucial component at the intersection of data and AI. As data structures, they encode the inherent meaning of the data they represent, and their significance becomes apparent when they are compared to one another. Vector search is a technique that uncovers the relative meaning of those embeddings by evaluating the distances between …