faang

How PDI built an enterprise-grade RAG system for AI applications with AWS

PDI Technologies is a global leader in the convenience retail and petroleum wholesale industries. They help businesses around the globe…

4 weeks ago

Scaling WideEP Mixture-of-Experts inference with Google Cloud A4X (GB200) and NVIDIA Dynamo

As organizations transition from standard LLMs to massive Mixture-of-Experts (MoE) architectures like DeepSeek-R1, the primary constraint has shifted from raw…

4 weeks ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Diffusion large language models (dLLMs) are compelling alternatives to autoregressive (AR) models because their denoising models operate over the entire…

4 weeks ago

How Thomson Reuters built an Agentic Platform Engineering Hub with Amazon Bedrock AgentCore

This post was co-written with Naveen Pollamreddi and Seth Krause from Thomson Reuters. Thomson Reuters (TR) is a leading AI…

4 weeks ago

Introducing multimodal retrieval for Amazon Bedrock Knowledge Bases

We are excited to announce the general availability of multimodal retrieval for Amazon Bedrock Knowledge Bases. This new capability adds…

4 weeks ago

ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models

Recurrent Neural Networks (RNNs) laid the foundation for sequence modeling, but their intrinsic sequential nature restricts parallel computation, creating a…

1 month ago

Advanced fine-tuning techniques for multi-agent orchestration: Patterns from Amazon at scale

Our work with large enterprise customers and Amazon teams has revealed that high stakes use cases continue to benefit significantly…

1 month ago

Cloud CISO Perspectives: Practical guidance on building with SAIF

Welcome to the first Cloud CISO Perspectives for January 2026. Today, Tom Curry and Anton Chuvakin, from Google Cloud’s Office…

1 month ago

How the Amazon AMET Payments team accelerates test case generation with Strands Agents

At Amazon.ae, we serve approximately 10 million customers monthly across five countries in the Middle East and North Africa region—United…

1 month ago

Introducing BigQuery managed and SQL-native inference for open models

BigQuery provides access to a variety of LLMs for text and embedding generation, including Google's Gemini models, Google-managed models from…

1 month ago