On a Neural Implementation of Brenier’s Polar Factorization

In 1991, Brenier proved a theorem that generalizes the polar decomposition for square matrices — factored as PSD ×times× unitary — to any vector field F:Rd→RdF:mathbb{R}^drightarrow mathbb{R}^dF:Rd→Rd. The theorem, known as the polar factorization theorem, states that any field FFF can be recovered as the composition of the gradient of a convex function uuu with …

NeMo EKS.drawio 2

Accelerate your generative AI distributed training workloads with the NVIDIA NeMo Framework on Amazon EKS

In today’s rapidly evolving landscape of artificial intelligence (AI), training large language models (LLMs) poses significant challenges. These models often require enormous computational resources and sophisticated infrastructure to handle the vast amounts of data and complex algorithms involved. Without a structured framework, the process can become prohibitively time-consuming, costly, and complex. Enterprises struggle with managing …

image1 yGXfK9U.max 1000x1000 1

How Gramercy Tech used Imagen to deliver an innovative conference experience

Editor’s note: In this guest post, we will hear from Gramercy Tech about their experience working with Google Cloud as both a customer and vendor. Organizing engaging events can be quite challenging, but by utilizing Google’s Imagen throughout the conference, the Gramercy team was able to demonstrate the possibilities of generative AI for creating real-time …

Contrasting Multiple Representations with the Multi-Marginal Matching Gap

Learning meaningful representations of complex objects that can be seen through multiple (k≥3kgeq 3k≥3) views or modalities is a core task in machine learning. Existing methods use losses originally intended for paired views, and extend them to kkk views, either by instantiating 12k(k−1)tfrac12k(k-1)21​k(k−1) loss-pairs, or by using reduced embeddings, following a one vs. average-of-resttextit{one vs. average-of-rest}one vs. average-of-rest strategy. …

image001 video dubbing

Video auto-dubbing using Amazon Translate, Amazon Bedrock, and Amazon Polly

This post is co-written with MagellanTV and Mission Cloud.  Video dubbing, or content localization, is the process of replacing the original spoken language in a video with another language while synchronizing audio and video. Video dubbing has emerged as a key tool in breaking down linguistic barriers, enhancing viewer engagement, and expanding market reach. However, …

Careful With That Scalpel: Improving Gradient Surgery With an EMA

Beyond minimizing a single training loss, many deep learning estimation pipelines rely on an auxiliary objective to quantify and encourage desirable properties of the model (e.g. performance on another dataset, robustness, agreement with a prior). Although the simplest approach to incorporating an auxiliary loss is to sum it with the training loss as a regularizer, …

ConsiStory

Mile-High AI: NVIDIA Research to Present Advancements in Simulation and Gen AI at SIGGRAPH

NVIDIA is taking an array of advancements in rendering, simulation and generative AI to SIGGRAPH 2024, the premier computer graphics conference, which will take place July 28 – Aug. 1 in Denver. More than 20 papers from NVIDIA Research introduce innovations advancing synthetic data generators and inverse rendering tools that can help train next-generation models. …

Is Your Chatbot Sabotaging Your Business? The Tell-Tale Signs and a Rescue Plan

It’s no exaggeration to say that chatbots are a mainstream technology. A whopping 88% of web users chatted with bots in the previous year. What’s more, 7 out of 10 find the experience positive. However, despite these promising statistics, there’s still a significant portion of users who have less-than-stellar experiences with conversational systems. The Dark …

12AfdQnROWGl8mhzI3LJyMVoA

Palantir’s Response to OMB on Responsible Procurement of AI in Government

Editors Note: This blog post highlights Palantir’s response to a Request for Information pursuant to the 2023 Executive Order on Safe, Secure, and Trustworthy AI. For more information about Palantir’s contributions to AI Policy, visit our website here. Introduction At Palantir, we are proud to provide the institutions serving our societies with the vital software platforms …

ML 16129 arch diag

Using Agents for Amazon Bedrock to interactively generate infrastructure as code

In the diverse toolkit available for deploying cloud infrastructure, Agents for Amazon Bedrock offers a practical and innovative option for teams looking to enhance their infrastructure as code (IaC) processes. Agents for Amazon Bedrock automates the prompt engineering and orchestration of user-requested tasks. After being configured, an agent builds the prompt and augments it with …