ai/ml

Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs

Organizations need ML compute resources that can accommodate bursty peaks and periodic troughs. That means the consumption models for AI…

3 weeks ago

Build an intelligent eDiscovery solution using Amazon Bedrock Agents

Legal teams spend bulk of their time manually reviewing documents during eDiscovery. This process involves analyzing electronically stored information across…

3 weeks ago

Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI

Developers building with gen AI are increasingly drawn to open models for their power and flexibility. But customizing and deploying…

3 weeks ago

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Recent advances in large language models (LLMs) have increased the demand for comprehensive benchmarks to evaluate their capabilities as human-like…

3 weeks ago

Boost cold-start recommendations with vLLM on AWS Trainium

Cold start in recommendation systems goes beyond just new user or new item problems—it’s the complete absence of personalized signals…

3 weeks ago

New Cluster Director features: Simplified GUI, managed Slurm, advanced observability

In April, we released Cluster Director, a unified management plane that makes deploying and managing large-scale AI infrastructure simpler and…

3 weeks ago

Aeneas transforms how historians connect the past

We’re publishing a paper in Nature introducing Aeneas, the first AI model for contextualizing ancient inscriptions.

3 weeks ago

mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages

Knowledge Graphs represent real-world entities and the relationships between them. Multilingual Knowledge Graph Construction (mKGC) refers to the task of…

3 weeks ago

Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization

At the AWS Summit in New York City, we introduced a comprehensive suite of model customization capabilities for Amazon Nova…

3 weeks ago

Gemini 2.5 Flash-Lite is now ready for scaled production use

Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a…

3 weeks ago