ai/ml

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

Efficient large-scale inference of transformer-based large language models (LLMs) remains a fundamental systems challenge, frequently requiring multi-GPU parallelism to meet…

6 days ago

How Amazon uses Amazon Nova models to automate operational readiness testing for new fulfillment centers

Amazon is a global ecommerce and technology company that operates a vast network of fulfillment centers to store, process, and…

6 days ago

Gemini Enterprise Agent Ready (GEAR) program now available, a new path to building AI agents at scale

Today’s reality is agentic – software that can reason, plan, and act on your behalf to execute complex workflows. To…

6 days ago

Automated Reasoning checks rewriting chatbot reference implementation

Today, we are publishing a new open source sample chatbot that shows how to use feedback from Automated Reasoning checks…

1 week ago

How PARTs Assemble into Wholes: Learning the Relative Composition of Images

The composition of objects and their parts, along with object-object positional relationships, provides a rich source of information for representation…

1 week ago

Structured outputs on Amazon Bedrock: Schema-compliant AI responses

Today, we’re announcing structured outputs on Amazon Bedrock—a capability that fundamentally transforms how you can obtain validated JSON responses from…

1 week ago

How we cut Vertex AI latency by 35% with GKE Inference Gateway

As generative AI moves from experimentation to production, platform engineers face a universal challenge for inference serving: you need low…

1 week ago

How Associa transforms document classification with the GenAI IDP Accelerator and Amazon Bedrock

This is a guest post co-written with David Meredith and Josh Zacharias from Associa. Associa, North America’s largest community management…

2 weeks ago

Announcing Claude Opus 4.6 on Vertex AI

At Google Cloud, we’re committed to providing customers with the leading selection of models to build and scale production-ready AI…

2 weeks ago

Accelerating your marketing ideation with generative AI – Part 2: Generate custom marketing images from historical references

Marketing teams face major challenges creating campaigns in today’s digital environment. They must navigate through complex data analytics and rapidly…

2 weeks ago