faang

Personalized Group Relative Policy Optimization for Heterogenous Preference Alignment

Despite their sophisticated general-purpose capabilities, Large Language Models (LLMs) often fail to align with diverse individual preferences because standard post-training…

2 weeks ago

Smarter Live Streaming at Scale: Rolling Out VBR for All Netflix Live Events

By Renata Teixeira, Zhi Li, Reenal Mahajan, and Wei WeiOn January 26, 2026, we flipped an important switch for Live at…

2 weeks ago

Simulate realistic users to evaluate multi-turn AI agents in Strands Evals

Evaluating single-turn agent interactions follows a pattern that most teams understand well. You provide an input, collect the output, and…

2 weeks ago

How Honeylove boosts product quality and service efficiency with BigQuery

Building the perfect bra takes thousands of data points. That’s why Honeylove isn’t just another intimates brand. We’re a technology…

2 weeks ago

Automating competitive price intelligence with Amazon Nova Act

Monitoring competitor prices is essential for ecommerce teams to maintain a market edge. However, many teams remain trapped in manual…

3 weeks ago

Run real-time and async inference on the same infrastructure with GKE Inference Gateway

As AI workloads transition from experimental prototypes to production-grade services, the infrastructure supporting them faces a growing utilization gap. Enterprises…

3 weeks ago

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Long-Form Texts

We introduce ProText, a dataset for measuring gendering and misgendering in stylistically diverse long-form English texts. ProText spans three dimensions:…

3 weeks ago

Build reliable AI agents with Amazon Bedrock AgentCore Evaluations

Your AI agent worked in the demo, impressed stakeholders, handled test scenarios, and seemed ready for production. Then you deployed…

3 weeks ago

Entropy-Preserving Reinforcement Learning

Policy gradient algorithms have driven many recent advancements in language model reasoning. An appealing property is their ability to learn…

3 weeks ago

How Ring scales global customer support with Amazon Bedrock Knowledge Bases

This post is cowritten with David Kim, and Premjit Singh from Ring. Scaling self-service support globally presents challenges beyond translation.…

3 weeks ago