james park

Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices

NVIDIA NIM microservices now integrate with Amazon SageMaker, allowing you to deploy industry-leading large language models (LLMs) and optimize model performance and cost. You can deploy state-of-the-art LLMs in minutes instead of days using technologies such as NVIDIA TensorRT, NVIDIA TensorRT-LLM, and NVIDIA Triton Inference Server on NVIDIA accelerated instances hosted by SageMaker. NIM, part …

1.high level arch.max 1000x1000 1

Accelerate your generative AI journey with NVIDIA NeMo framework on GKE

Background Ever since generative AI gained prominence in the AI field, organizations ranging from startups to large enterprises have moved to harness its power by making it an integral part of their applications, solutions, and platforms. While the true potential of generative AI lies in creating new content based on learning from existing content, it …

NVIDIA Maxine Developer Platform to Transform $10 Billion Video Conferencing Industry

Video conferencing has allowed many to be productive from anywhere. Now NVIDIA is boosting the productivity of the developers of video conferencing, call center and streaming applications within the $10 billion industry by allowing them to easily integrate AI into their workflows. The new release of the Maxine AI Developer Platform transforms the creation of …

Two artificial intelligences talk to each other

Performing a new task based solely on verbal or written instructions, and then describing it to others so that they can reproduce it, is a cornerstone of human communication that still resists artificial intelligence (AI). A team has succeeded in modelling an artificial neural network capable of this cognitive prowess. After learning and performing a …