GPU blog gif 2

Run your AI inference applications on Cloud Run with NVIDIA GPUs

Developers love Cloud Run for its simplicity, fast autoscaling, scale-to-zero capabilities, and pay-per-use pricing. Those same benefits come into play for real-time inference apps serving open gen AI models. That’s why today, we’re adding support for NVIDIA L4 GPUs to Cloud Run, in preview. This opens the door to many new use cases to Cloud …

Lightweight Champ: NVIDIA Releases Small Language Model With State-of-the-Art Accuracy

Developers of generative AI typically face a tradeoff between model size and accuracy. But a new language model released by NVIDIA delivers the best of both, providing state-of-the-art accuracy in a compact form factor. Mistral-NeMo-Minitron 8B — a miniaturized version of the open Mistral NeMo 12B model released by Mistral AI and NVIDIA last month …

Researchers train a robot dog to combat invasive fire ants

A multidisciplinary research team based across China and Brazil has used a dog-like robot and AI to create a new way to find fire ant nests. Published in the journal Pest Management Science, the study highlights how a “CyberDog” robot integrated with an AI model can automate the identification and control of Red Imported Fire …

3 Ways of Using Gemma 2 Locally

After the highly successful launch of Gemma 1, the Google team introduced an even more advanced model series called Gemma 2. This new family of Large Language Models (LLMs) includes models with 9 billion (9B) and 27 billion (27B) parameters. Gemma 2 offers higher performance and greater inference efficiency than its predecessor, with significant safety …

ml 17291 image001

Migrate Amazon SageMaker Data Wrangler flows to Amazon SageMaker Canvas for faster data preparation

Amazon SageMaker Data Wrangler provides a visual interface to streamline and accelerate data preparation for machine learning (ML), which is often the most time-consuming and tedious task in ML projects. Amazon SageMaker Canvas is a low-code no-code visual interface to build and deploy ML models without the need to write code. Based on customers’ feedback, …

AI assistant monitors teamwork to promote effective collaboration

On a research cruise around Hawaii in 2018, Yuening Zhang SM ’19, Ph.D. ’24 saw how difficult it was to keep a tight ship. The careful coordination required to map underwater terrain could sometimes lead to a stressful environment for team members, who might have different understandings of which tasks must be completed in spontaneously …