Announcing Anthropic’s upgraded Claude 3.5 Sonnet on Vertex AI

At Google Cloud, we’ve taken an open approach in building our Vertex AI platform — to provide the most powerful AI tools available along with unparalleled choice and flexibility. That’s why Vertex AI delivers access to over 160 models — including first-party, open-source, and third-party models — so you can build solutions specifically tailored to …

Efficient Source-Free Time-Series Adaptation via Parameter Subspace Disentanglement

The growing demand for personalized and private on-device applications highlights the importance of source-free unsupervised domain adaptation (SFDA) methods, especially for time-series data, where individual differences produce large domain shifts. As sensor-embedded mobile devices become ubiquitous, optimizing SFDA methods for parameter utilization and data-sample efficiency in time-series contexts becomes crucial. Personalization in time series is …

12AC9E 4zECwvZxb5QkWRIxIQ

From Prototype to Production

From Prototype to Production (Engineering Responsible AI, #3) Testing and Evaluating AI Systems with AIP Evals Editor’s Note: This is the third post in a series on responsible AI. Proof Over Promises: The Challenge of Making AI Work While it’s relatively easy to create a prototype AI system using Generative AI, addressing your organization’s most pressing challenges with …

ML 17694 image001

Amazon Bedrock Custom Model Import now generally available

Today, we’re pleased to announce the general availability (GA) of Amazon Bedrock Custom Model Import. This feature empowers customers to import and use their customized models alongside existing foundation models (FMs) through a single, unified API. Whether leveraging fine-tuned models like Meta Llama, Mistral Mixtral, and IBM Granite, or developing proprietary models based on popular …

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

*Equal Contributors Current multimodal and multitask foundation models like 4M or UnifiedIO show promising results, but in practice their out-of-the-box abilities to accept diverse inputs and perform diverse tasks are limited by the (usually rather small) number of modalities and tasks they are trained on. In this paper, we significantly expand upon the capabilities of …

12ATm85xrX2S5AfhnXQxqXEEw

Turning Conversation Into Action

Turning Conversation Into Action (Palantir CSE #2) Anchoring AI Agents Into the Enterprise Editor’s Note: This is the second in a three-part blog series about Palantir’s AI-enabled Customer Service Engine. Part 2: Implementation In Part 1 of this three-part blog series, we explored the agentic architecture of the Customer Service Engine (CSE) through the lens of a …

ML 17117 image001

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

This post is co-written Rodrigo Amaral, Ashwin Murthy and Meghan Stronach from Qualcomm. In this post, we introduce an innovative solution for end-to-end model customization and deployment at the edge using Amazon SageMaker and Qualcomm AI Hub. This seamless cloud-to-edge AI development experience will enable developers to create optimized, highly performant, and custom managed machine …

image001 2

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. During re:Invent 2023, we launched AWS HealthScribe, a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation. In …

Scalable Private Search with Wally

This paper presents Wally, a private search system that supports efficient semantic and keyword search queries against large databases. When sufficiently many clients are making queries, Wally’s performance is significantly better than previous systems. In previous private search systems, for each client query, the server must perform at least one expensive cryptographic operation per database …

dpg architecture

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

This post was co-written with Lucas Desard, Tom Lauwers, and Sam Landuydt from DPG Media. DPG Media is a leading media company in Benelux operating multiple online platforms and TV channels. DPG Media’s VTM GO platform alone offers over 500 days of non-stop content. With a growing library of long-form video content, DPG Media recognizes …