ml 17926 meta sam 1 2

Meta SAM 2.1 is now available in Amazon SageMaker JumpStart

This blog post is co-written with George Orlin from Meta. Today, we are excited to announce that Meta’s Segment Anything Model (SAM) 2.1 vision segmentation model is publicly available through Amazon SageMaker JumpStart to deploy and run inference. Meta SAM 2.1 provides state-of-the-art video and image segmentation capabilities in a single model. This cutting-edge model …

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

*Primary Contributors Attention is a key part of the transformer architecture. It is a sequence-to-sequence mapping that transforms each sequence element into a weighted sum of values. The weights are typically obtained as the softmax of dot products between keys and queries. Recent work has explored alternatives to softmax attention in transformers, such as ReLU …

Picture1 6

Transforming credit decisions using generative AI with Rich Data Co and AWS

This post is co-written with Gordon Campbell, Charles Guan, and Hendra Suryanto from RDC.  The mission of Rich Data Co (RDC) is to broaden access to sustainable credit globally. Its software-as-a-service (SaaS) solution empowers leading banks and lenders with deep customer insights and AI-driven decision-making capabilities. Making credit decisions using AI can be challenging, requiring …

1 neflowrdma.max 1000x1000 1

Networking support for AI workloads

At Google Cloud, we strive to make it easy to deploy AI models onto our infrastructure. In this blog we explore how the Cross-Cloud Network solution supports your AI workloads. Managed and Unmanaged AI options Google Cloud provides both managed (Vertex AI) and do-it-yourself (DIY) approaches for running AI workloads.  Vertex AI: A fully managed …