faang

The new AI literacy: Insights from student developers

AI has made it easier than ever for student developers to work efficiently, tackle harder problems, and pursue ambitious projects.…

1 month ago

Exclusive Self Attention

We introduce exclusive self attention (XSA), a simple modification of self attention (SA) that improves Transformer’s sequence modeling performance. The…

2 months ago

Unlocking video insights at scale with Amazon Bedrock multimodal models

Video content is now everywhere, from security surveillance and media production to social platforms and enterprise communications. However, extracting meaningful…

2 months ago

DRA: A new era of Kubernetes device management with Dynamic Resource Allocation

The explosion of large language models (LLMs) has increased demand for high-performance accelerators like GPUs and TPUs. As organizations scale…

2 months ago

AI-powered robot learns how to harvest tomatoes more efficiently

A new tomato-picking robot is learning to think before it acts. Instead of simply identifying ripe fruit, it predicts how…

2 months ago

Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs

Large Language Models (LLMs) often lack meaningful confidence estimates for their outputs. While base LLMs are known to exhibit next-token…

2 months ago

Manufacturing with the Connected Edge

Industrial and defense environments generate massive amounts of data that can’t wait for the cloud. Latency is often measured in…

2 months ago

Scaling Global Storytelling: Modernizing Localization Analytics at Netflix

Valentin Geffrier, Tanguy CornuauEach year, we bring the Analytics Engineering community together for an Analytics Summit — a multi-day internal conference to share…

2 months ago

Deploy SageMaker AI inference endpoints with set GPU capacity using training plans

Deploying large language models (LLMs) for inference requires reliable GPU capacity, especially during critical evaluation periods, limited-duration production testing, or…

2 months ago

Kubernetes as AI Infrastructure: Google Cloud, llm-d, and the CNCF

At Google Cloud, serving the massive-scale needs of large foundation model builders and AI-native companies is at the forefront of…

2 months ago