Vertex AI Latency Comparisonpng5wmax 1000x1000 1

How we cut Vertex AI latency by 35% with GKE Inference Gateway

As generative AI moves from experimentation to production, platform engineers face a universal challenge for inference serving: you need low latency, high throughput, and manageable costs.  It is a difficult balance. Traffic patterns vary wildly, from complex coding tasks that require processing huge amounts of data, to quick, chatty conversations that demand instant replies. Standard …

image 1 13

How Associa transforms document classification with the GenAI IDP Accelerator and Amazon Bedrock

This is a guest post co-written with David Meredith and Josh Zacharias from Associa. Associa, North America’s largest community management company, oversees approximately 7.5 million homeowners with 15,000 employees across more than 300 branch offices. The company manages approximately 48 million documents across 26 TB of data, but their existing document management system lacks efficient …

shopify 5g2sFtzmax 1000x1000 1

Announcing Claude Opus 4.6 on Vertex AI

At Google Cloud, we’re committed to providing customers with the leading selection of models to build and scale production-ready AI apps and agents on a platform optimized for performance, trust, and global scale. Today, we’re further expanding Vertex AI’s curated collection of models with the addition of Anthropic’s newest release: Claude Opus 4.6. Claude Opus …

ML 20209 image 1

Accelerating your marketing ideation with generative AI – Part 2: Generate custom marketing images from historical references

Marketing teams face major challenges creating campaigns in today’s digital environment. They must navigate through complex data analytics and rapidly changing consumer preferences to produce engaging, personalized content across multiple channels while maintaining brand consistency and working within tight deadlines. Using generative AI can streamline and accelerate the creative process while maintaining alignment with business …

A Reinforcement Learning Based Universal Sequence Design for Polar Codes

To advance Polar code design for 6G applications, we develop a reinforcement learning-based universal sequence design framework that is extensible and adaptable to diverse channel conditions and decoding strategies. Crucially, our method scales to code lengths up to 2048, making it suitable for use in standardization. Across all (N,K)(N, K)(N,K) configurations supported in 5G, our …

ML 20204 image 1

Democratizing business intelligence: BGL’s journey with Claude Agent SDK and Amazon Bedrock AgentCore

This post is cowritten with James Luo from BGL. Data analysis is emerging as a high-impact use case for AI agents. According to Anthropic’s 2026 State of AI Agents Report, 60% of organizations rank data analysis and report generation as their most impactful agentic AI applications. 65% of enterprises cite it as a top priority. …

ml 193561

How Clarus Care uses Amazon Bedrock to deliver conversational contact center interactions

This post was cowritten by Rishi Srivastava and Scott Reynolds from Clarus Care. Many healthcare practices today struggle with managing high volumes of patient calls efficiently. From appointment scheduling and prescription refills to billing inquiries and urgent medical concerns, practices face the challenge of providing timely responses while maintaining quality patient care. Traditional phone systems …

1 tjBW7LCmax 1000x1000 1

Build intelligent employee onboarding with Gemini Enterprise

Employee onboarding is rarely a linear process. It’s a complex web of dependencies that vary significantly based on an individual’s specific profile. For example, even a simple request for a laptop requires the system to cross-reference the employee’s role, function, and seniority level to determine whether they need a high-powered workstation or a standard mobile …

Self-Supervised Learning with Gaussian Processes

Self supervised learning (SSL) is a machine learning paradigm where models learn to understand the underlying structure of data without explicit supervision from labeled samples. The acquired representations from SSL have demonstrated useful for many downstream tasks including clustering, and linear classification, etc. To ensure smoothness of the representation space, most SSL methods rely on …

15dQn8UoWO0qWm X7gg8qVQ

How Palantir AIP Accelerates Data Migration

The Octopus Model for Enterprise Transformation Enterprise data migration can be among the most costly, complex, and time-consuming endeavors organizations undertake — but it doesn’t have to be. Traditional migrations require coordinating consultants alongside separated internal business and technology teams to unlock the potential of data stored in brittle ERP system, customized SAP legacy instances, or even …