Adapting Self-Supervised Representations as a Latent Space for Efficient Generation

We introduce Representation Tokenizer (RepTok), a generative modeling framework that represents an image using a single continuous latent token obtained from self-supervised vision transformers. Building on a pre-trained SSL encoder, we fine-tune only the semantic token embedding and pair it with a generative decoder trained jointly using a standard flow matching objective. This adaptation enriches …

10I8DAvXCQEN1RpTTC0ZyaQ

Supercharging the ML and AI Development Experience at Netflix

Supercharging the ML and AI Development Experience at Netflix with Metaflow Shashank Srikanth, Romain Cledat Metaflow — a framework we started and open-sourced in 2019 — now powers a wide range of ML and AI systems across Netflix and at many other companies. It is well loved by users for helping them take their ML/AI workflows from prototype to production, allowing …

Screenshot 2025 11 04 at 101509AM

Iterate faster with Amazon Bedrock AgentCore Runtime direct code deployment

Amazon Bedrock AgentCore is an agentic platform for building, deploying, and operating effective agents securely at scale. Amazon Bedrock AgentCore Runtime is a fully managed service of Bedrock AgentCore, which provides low latency serverless environments to deploy agents and tools. It provides session isolation, supports multiple agent frameworks including popular open-source frameworks, and handles multimodal …

Databricks research reveals that building better AI judges isn’t just a technical concern, it’s a people problem

The intelligence of AI models isn’t what’s blocking enterprise deployments. It’s the inability to define and measure quality in the first place. That’s where AI judges are now playing an increasingly important role. In AI evaluation, a “judge” is an AI system that scores outputs from another AI system.  Judge Builder is Databricks’ framework for …

Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors

AI policy sets boundaries on acceptable behavior for AI models, but this is challenging in the context of large language models (LLMs): how do you ensure coverage over a vast behavior space? We introduce policy maps, an approach to AI policy design inspired by the practice of physical mapmaking. Instead of aiming for full coverage, …

Screenshot 2025 10 29 at 90917AM

How Switchboard, MD automates real-time call transcription in clinical contact centers with Amazon Nova Sonic

In high-volume healthcare contact centers, every patient conversation carries both clinical and operational significance, making accurate real-time transcription necessary for automated workflows. Accurate, instant transcription enables intelligent automation without sacrificing clarity or care, so that teams can automate electronic medical record (EMR) record matching, streamline workflows, and eliminate manual data entry. By removing routine process …