Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

1 year ago

In our previous blog posts, we explored various techniques such as fine-tuning large language models (LLMs), prompt engineering, and Retrieval…

Demonstrating the AI-driven telecom at Mobile World Congress

1 year ago

Telecoms, like all businesses, are wondering how AI can transform their businesses. And there’s no better way to display how…

Stories We Can’t Stop Thinking About: Deepfakes, the Tesla Backlash, and All Things Chips

1 year ago

This week on “Uncanny Valley,” our hosts talk about three big stories from February.

How Pattern PXM’s Content Brief is driving conversion on ecommerce marketplaces using AI

1 year ago

Brands today are juggling a million things, and keeping product content up-to-date is at the top of the list. Between…

Rebuilding Alexa: How Amazon is mixing models, agents and browser-use for smarter AI

1 year ago

In rearchitecting the upgraded Alexa voice assistant, Amazon turned to model mixing to bring agentic capabilities to devices.Read More

A Deadly Unidentified Disease Has Emerged in the DRC

1 year ago

More than 50 people have died in the Democratic Republic of the Congo, most within 48 hours of the onset…

A springtail-like jumping robot

1 year ago

Springtails, small bugs often found crawling through leaf litter and garden soil, are expert jumpers. Inspired by these hopping hexapods,…

Self-driving cars learn to share road knowledge through digital word-of-mouth

1 year ago

An NYU Tandon-led research team has developed a way for self-driving vehicles to share their knowledge about road conditions indirectly,…

Start building with Gemini 2.0 Flash and Flash-Lite

1 year ago

Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for…

MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs

1 year ago

We introduce MIA-Bench, a new benchmark designed to evaluate multimodal large language models (MLLMs) on their ability to strictly adhere…