1 mVzr1Mr.max 1000x1000 1

Supervised Fine Tuning for Gemini: A best practices guide

Foundation models such as Gemini have revolutionized how we work, but sometimes they need guidance to excel at specific business tasks. Perhaps their answers are too long, or their summaries miss the mark. That’s where supervised fine-tuning (SFT) comes in. When done right, it unlocks incredible precision to tailor Gemini for specialized tasks, domains, and …

3D Shape Tokenization

We introduce Shape Tokens, a 3D representation that is continuous, compact, and easy to integrate into machine learning models. Shape Tokens serve as conditioning vectors, representing shape information within a 3D flow-matching model. This flow-matching model is trained to approximate probability density functions corresponding to delta functions concentrated on the surfaces of 3D shapes. By …

Title Launch Observability at Netflix Scale

Part 2: Navigating Ambiguity By: Varun Khaitan With special thanks to my stunning colleagues: Mallika Rao, Esmir Mesic, Hugo Marques Building on the foundation laid in Part 1, where we explored the “what” behind the challenges of title launch observability at Netflix, this post shifts focus to the “how.” How do we ensure every title launches seamlessly …

ML 15583 System Workflow

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

In this post, we walk you through the process to build an automated mechanism using Amazon SageMaker to process your log data, run training iterations over it to obtain the best-performing anomaly detection model, and register it with the Amazon SageMaker Model Registry for your customers to use it. Log-based anomaly detection involves identifying anomalous …

The PyTorch developer’s guide to JAX fundamentals

Like many PyTorch users, you may have heard great things about JAX — its high performance, the elegance of its functional programming approach, and its powerful, built-in support for parallel computation. However, you may have also struggled to find what you need to get started: a straightforward, easy-to-follow tutorial to help you understand the basics …

Part 2: A Survey of Analytics Engineering Work at Netflix

This article is the second in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Need to catch up? Check out Part 1. In this article, we highlight a few exciting analytic business applications, and in our final article we’ll go …

12Au17ZKsvYcR6rwxcEWpZHFA

Telegram Chatbots: Are They a Good Fit for Your Business?

Telegram chatbots are rapidly gaining traction, with over 1.5 million bots already created. As one of the fastest-growing messaging platforms, Telegram boasts a user base exceeding 550 million globally, offering businesses an unparalleled opportunity to engage with their audience effectively. In an era where customers prefer direct communication, research from Social Media Today reveals that …

Have You Heard? 5 AI Podcast Episodes Listeners Loved in 2024

NVIDIA’s AI Podcast gives listeners the inside scoop on the ways AI is transforming nearly every industry.  Since the show’s debut in 2016, it’s garnered more than 6 million listens across 200-plus episodes, covering how generative AI is used to power applications including assistive technology for the visually impaired, wildfire alert systems and the Roblox …

ML 17334 RAG

Optimizing costs of generative AI applications on AWS

The report The economic potential of generative AI: The next productivity frontier, published by McKinsey & Company, estimates that generative AI could add an equivalent of $2.6 trillion to $4.4 trillion in value to the global economy. The largest value will be added across four areas: customer operations, marketing and sales, software engineering, and R&D. …