Categories: AI/ML Research

Statistical Methods for Evaluating LLM Performance

The large language model (LLM) has become a cornerstone of many AI applications.
AI Generated Robotic Content

Recent Posts

Getting started with computer use in Amazon Bedrock Agents

Computer use is a breakthrough capability from Anthropic that allows foundation models (FMs) to visually…

4 hours ago

OpenAI’s strategic gambit: The Agents SDK and why it changes everything for enterprise AI

OpenAI's new API and Agents SDK consolidate a previously fragmented complex ecosystem into a unified,…

5 hours ago

Under Trump, AI Scientists Are Told to Remove ‘Ideological Bias’ From Powerful Models

A directive from the National Institute of Standards and Technology eliminates mention of “AI safety”…

5 hours ago

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Speech foundation models, such as HuBERT and its variants, are pre-trained on large amounts of…

1 day ago

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

This post was co-written with Vishal Singh, Data Engineering Leader at Data & Analytics team…

1 day ago

10 months to innovation: Definity’s leap to data agility with BigQuery and Vertex AI

At Definity, a leading Canadian P&C insurer with a history spanning over 150 years, we…

1 day ago