image1 ur0KlJxmax 1000x1000 1

A new top score: Advancing Text-to-SQL on the BIRD benchmark

In the fast-evolving world of agentic development, natural language is becoming the standard for interaction. This shift is deeply connected to the power of operational databases, where a more accurate text-to-SQL capability is a major catalyst for building better, more capable agents. From empowering non-technical users to self-serve data, to accelerating analyst productivity, the ability …

Expanding support for AI developers on Hugging Face

For those building with AI, most are in it to change the world — not twiddle their thumbs. So when inspiration strikes, the last thing anyone wants is to spend hours waiting for the latest AI models to download to their development environment. That’s why today we’re announcing a deeper partnership between Hugging Face and …

Everyone can now fly their own drone.

TL;DR Using Google’s new Veo 3.1 video model, we created a breathtaking 1 minute 40 second FPV drone flight through mountain valleys, and it took just 15 minutes to generate. Imagine soaring through alpine valleys, gliding between snowy peaks, and diving toward rivers that twist like silver ribbons below, all without leaving your desk. That’s …

CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching

Conditional generative modeling aims to learn a conditional data distribution from samples containing data-condition pairs. For this, diffusion and flow-based methods have attained compelling results. These methods use a learned (flow) model to transport an initial standard Gaussian noise that ignores the condition to the conditional data distribution. The model is hence required to learn …

1 hzHgTuYmax 1000x1000 1

Announcing BigQuery-managed AI functions for better SQL

For decades, SQL has been the universal language for data analysis, offering access to analytics on structured data. Large Language Models (LLMs) like Gemini now provide a path to get nuanced insights from unstructured data such as text, image and video. However, integrating LLMs into standard SQL flow requires data movement, at least some prompt …

ML 19929 1

Introducing agent-to-agent protocol support in Amazon Bedrock AgentCore Runtime

We recently announced the support for Agent-to-Agent (A2A) protocol on Amazon Bedrock AgentCore Runtime. With this addition, agents can discover peers, share capabilities, and coordinate actions across platforms using standardized communication. Amazon Bedrock AgentCore Runtime provides a secure, serverless environment designed for deploying AI agents and tools. It works with any framework and model, supports …

BigQuery under the hood: How Google brought embeddings to analytics

Embeddings are a crucial component at the intersection of data and AI. As data structures, they encode the inherent meaning of the data they represent, and their significance becomes apparent when they are compared to one another. Vector search is a technique that uncovers the relative meaning of those embeddings by evaluating the distances between …

image 1 17

Fine-tune VLMs for multipage document-to-JSON with SageMaker AI and SWIFT

Extracting structured data from documents like invoices, receipts, and forms is a persistent business challenge. Variations in format, layout, language, and vendor make standardization difficult, and manual data entry is slow, error-prone, and unscalable. Traditional optical character recognition (OCR) and rule-based systems often fall short in handling this complexity. For instance, a regional bank might …

image1 HnbQkXWmax 1000x1000 1

Running high-scale reinforcement learning (RL) for LLMs on GKE

As Large Language Models (LLMs) evolve, Reinforcement Learning (RL) is becoming the crucial technique for aligning powerful models with human preferences and complex task objectives. However, enterprises that need to implement and scale RL for LLMs are facing infrastructure challenges. The primary hurdles include the memory contention from concurrently hosting multiple large models (such as …

ExpertLens: Activation Steering Features Are Highly Interpretable

This paper was accepted at the Workshop on Unifying Representations in Neural Models (UniReps) at NeurIPS 2025. Activation steering methods in large language models (LLMs) have emerged as an effective way to perform targeted updates to enhance generated language without requiring large amounts of adaptation data. We ask whether the features discovered by activation steering …