ML 18787 image001

Use custom metrics to evaluate your generative AI application with Amazon Bedrock

With Amazon Bedrock Evaluations, you can evaluate foundation models (FMs) and Retrieval Augmented Generation (RAG) systems, whether hosted on Amazon Bedrock or another model or RAG system hosted elsewhere, including Amazon Bedrock Knowledge Bases or multi-cloud and on-premises deployments. We recently announced the general availability of the large language model (LLM)-as-a-judge technique in model evaluation …

image1 LTlj222max 1000x1000 1

Build live voice-driven agentic applications with Vertex AI Gemini Live API

Across industries, enterprises need efficient and proactive solutions. Imagine frontline professionals using voice commands and visual input to diagnose issues, access vital information, and initiate processes in real-time. The Gemini 2.0 Flash Live API empowers developers to create next-generation, agentic industry applications. This API extends these capabilities to complex industrial operations. Unlike solutions relying on …

1awfesgrbdmax 1000x1000 1

Pushing the limits of electric mobility: Formula E’s Mountain Recharge

When’s the last time you watched a race for the braking? It’s the heart-pounding acceleration and death-defying maneuvers that keep most motorsport fans on the edge of their seats. Especially when it comes to Formula E — and really all EVs — the explosive, near-instantaneous acceleration of an electric motor is part of the appeal. …

image001 1

Build a gen AI–powered financial assistant with Amazon Bedrock multi-agent collaboration

The Amazon Bedrock multi-agent collaboration feature gives developers the flexibility to create and coordinate multiple AI agents, each specialized for specific tasks, to work together efficiently on complex business processes. This enables seamless handling of sophisticated workflows through agent cooperation. This post aims to demonstrate the application of multiple specialized agents within the Amazon Bedrock …

image1 GrApv68max 1000x1000 1

Create chatbots that speak different languages with Gemini, Gemma, Translation LLM, and Model Context Protocol

Your customers might not all speak the same language. If you operate internationally or serve a diverse customer base, you need your chatbot to meet them where they are – whether they’re searching for something in Spanish or Japanese. If you want to give your customers multilingual support with chatbots, you’ll need to orchestrate multiple …

Local Pan-Privacy for Federated Analytics

Pan-privacy was proposed by Dwork et al. (2010) as an approach to designing a private analytics system that retains its privacy properties in the face of intrusions that expose the system’s internal state. Motivated by federated telemetry applications, we study local pan-privacy, where privacy should be retained under repeated unannounced intrusions on the local state. …

ml 18652 image 1

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock

Multimodal fine-tuning represents a powerful approach for customizing foundation models (FMs) to excel at specific tasks that involve both visual and textual information. Although base multimodal models offer impressive general capabilities, they often fall short when faced with specialized visual tasks, domain-specific content, or particular output formatting requirements. Fine-tuning addresses these limitations by adapting models …

1xxVKEve3C0kvgsPmq8tmMg

Data Streaming: Real-time data for real-time decisions

Data Streaming: Real-time data for real-time decisions (Palantir RFx Blog Series, #8) Often the most business critical decisions are also the most time-sensitive. Data streaming technologies let organizations act on information (almost) as quickly as it comes in. Editor’s note: This is the eighth post in the Palantir RFx Blog Series, which explores how organizations can better …

ML 18742 image 1

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Amazon Bedrock Model Distillation is generally available, and it addresses the fundamental challenge many organizations face when deploying generative AI: how to maintain high performance while reducing costs and latency. This technique transfers knowledge from larger, more capable foundation models (FMs) that act as teachers to smaller, more efficient models (students), creating specialized models that …

image001 3

Responsible AI in action: How Data Reply red teaming supports generative AI safety on AWS

Generative AI is rapidly reshaping industries worldwide, empowering businesses to deliver exceptional customer experiences, streamline processes, and push innovation at an unprecedented scale. However, amidst the excitement, critical questions around the responsible use and implementation of such powerful technology have started to emerge. Although responsible AI has been a key focus for the industry over …