Recordings of business meetings, interviews, and customer interactions have become essential for preserving important information. However, transcribing and summarizing these…
The pace of innovation in open-source AI is breathtaking, with models like Meta's Llama4 and DeepSeek AI's DeepSeek. However, deploying…
This post is co-written with Qing Chen and Mark Sinclair from Radial. Radial is the largest 3PL fulfillment provider, also…
Today, we are excited to announce that Gartner® has named Google as a Leader in the 2025 Magic Quadrant™ for…
Vision foundation models pre-trained on massive data encode rich representations of real-world concepts, which can be adapted to downstream tasks…
This post is co-written with Tatia Tsmindashvili, Ana Kolkhidashvili, Guram Dentoshvili, Dachi Choladze from Impel. Impel transforms automotive retail through…
Last month at Google Cloud Next ‘25, we announced MCP Toolbox for Databases to make it easier to connect generative…
Gemini 2.5 has new capabilities in AI-powered audio dialog and generation.
Cross-lingual transfer is a popular approach to increase the amount of training data for NLP tasks in a low-resource context.…
We’ve witnessed remarkable advances in model capabilities as generative AI companies have invested in developing their offerings. Language models such…