By Jove, It’s No Myth: NVIDIA Triton Speeds Inference on Oracle Cloud

An avid cyclist, Thomas Park knows the value of having lots of gears to maintain a smooth, fast ride. So, when the software architect designed an AI inference platform to serve predictions for Oracle Cloud Infrastructure’s (OCI) Vision AI service, he picked NVIDIA Triton Inference Server. That’s because it can shift up, down or sideways …

An Interview with Gemini

Hey, how are you? Should I call you Bard or Gemini? Hey there! I’m doing great, thanks for asking. You can call me whichever you prefer! Both Bard and Gemini are names for me. Bard reflects my storytelling and conversational abilities, while Gemini represents my latest upgrade with improved reasoning and factual accuracy. So, whether …

Coral Reef Halos 672x357 1

Creature Feature: Safari Across 5 Animal-Focused AI Initiatives of 2023

Whether abundant, endangered or extinct, animal species are the focus of countless AI-powered conservation projects. These initiatives — accelerated using NVIDIA GPUs, deep learning software and robotics technology — are alerting conservationists to poaching threats, powering more sustainable aquaculture and helping scientists monitor coral reef health. Take a safari through the NVIDIA Blog’s top animal …

Why AI may not speak freely.

Why are AIs (LLMs, i.e. Large Language Models) not allowed to speak freely? The limitations on the speech of Large Language Models (LLMs) like the one you’re interacting with are primarily due to a combination of ethical, legal, and practical considerations. Here are the main reasons for these restrictions: Ethical Considerations: AI systems must be …

Tune In to the Top 5 NVIDIA Videos of 2023

2023 was marked by the generative AI boom, representing a new era for how artificial intelligence can be used across industries. The year’s top videos from the NVIDIA YouTube channel reflect this focus, with popular videos highlighting the technology powering large language models, new platforms for building generative AI applications and how accelerated computing and …

year in review hero 1

2023: A year of groundbreaking advances in AI and computing

Posted by Jeff Dean, Chief Scientist, Google DeepMind & Google Research, Demis Hassabis, CEO, Google DeepMind, and James Manyika, SVP, Google Research, Technology & Society This has been a year of incredible progress in the field of Artificial Intelligence (AI) research and its practical applications. As ongoing research pushes AI even farther, we look back …

year in review hero

2023: A year of groundbreaking advances in AI and computing

Posted by Jeff Dean, Chief Scientist, Google DeepMind & Google Research, Demis Hassabis, CEO, Google DeepMind, and James Manyika, SVP, Google Research, Technology & Society This has been a year of incredible progress in the field of Artificial Intelligence (AI) research and its practical applications. As ongoing research pushes AI even farther, we look back …

Robert Van Dusen

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

Large language model (LLM) training has surged in popularity over the last year with the release of several popular models such as Llama 2, Falcon, and Mistral. Customers are now pre-training and fine-tuning LLMs ranging from 1 billion to over 175 billion parameters to optimize model performance for applications across industries, from healthcare to finance …