ai/ml

Positional Description for Numerical Normalization

We present a Positional Description Scheme (PDS) tailored for digit sequences, integrating placeholder value information for each digit. Given the…

2 years ago

Maximize your LLM serving throughput for GPUs on GKE — a practical guide

Let’s face it: Serving AI foundation models such as large language models (LLMs) can be expensive. Between the need for…

2 years ago

NVIDIA to Present Innovations at Hot Chips That Boost Data Center Performance and Energy Efficiency

A deep technology conference for processor and system architects from industry and academia has become a key forum for the…

2 years ago

Types of Chatbots: An Overview for Business People

In the effort to automate as many processes as possible, companies resort to various solutions powered by modern technology. We’ve…

2 years ago

On the Benefits of Pixel-Based Hierarchical Policies for Task Generalization

Reinforcement learning practitioners often avoid hierarchical policies, especially in image-based observation spaces. Typically, the single-task performance improvement over flat-policy counterparts…

2 years ago

Build private and secure enterprise generative AI applications with Amazon Q Business using IAM Federation

Amazon Q Business is a conversational assistant powered by generative artificial intelligence (AI) that enhances workforce productivity by answering questions…

2 years ago

Announcing the Jamba 1.5 Model Family from AI21 Labs on Vertex AI

Today, we’re announcing the launch of the Jamba 1.5 Model Family  — AI21 Labs’ new family of open models —…

2 years ago

Enhance call center efficiency using batch inference for transcript summarization with Amazon Bedrock

Today, we are excited to announce general availability of batch inference for Amazon Bedrock. This new feature enables organizations to…

2 years ago

Run your AI inference applications on Cloud Run with NVIDIA GPUs

Developers love Cloud Run for its simplicity, fast autoscaling, scale-to-zero capabilities, and pay-per-use pricing. Those same benefits come into play…

2 years ago

Lightweight Champ: NVIDIA Releases Small Language Model With State-of-the-Art Accuracy

Developers of generative AI typically face a tradeoff between model size and accuracy. But a new language model released by…

2 years ago