Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for…
We introduce MIA-Bench, a new benchmark designed to evaluate multimodal large language models (MLLMs) on their ability to strictly adhere…
Promoting Army readiness through seamless coordination between Palantir-powered Army Vantage platform and Microsoft Power BIBetter TogetherAs the Department of Defense (DoD)…
This post is co-written with Xavier Vizcaino, Diego Martín Montoro, and Jordi Sánchez Ferrer from Applus+ Idiada. In 2021, Applus+…
Many specialized vector databases today require you to create complex pipelines and applications in order to get the data you…
Today, we’re excited to announce that Mistral-Small-24B-Instruct-2501—a twenty-four billion parameter large language model (LLM) from Mistral AI that’s optimized for low…
Today, we’re announcing Claude 3.7 Sonnet, Anthropic’s most intelligent model to date and the first hybrid reasoning model on the…
The End of the AI Safety DebateFor years, a passionate contingent of researchers, ethicists, and policymakers warned about the potential…
TL;DR We compared Grok 3 and o3-mini’s results on this topic. They both passed. Since Grok 3 was released we…
This post was written with Dian Xu and Joel Hawkins of Rocket Companies. Rocket Companies is a Detroit-based FinTech company…