Categories: FAANG

Careful With That Scalpel: Improving Gradient Surgery With an EMA

Beyond minimizing a single training loss, many deep learning estimation pipelines rely on an auxiliary objective to quantify and encourage desirable properties of the model (e.g. performance on another dataset, robustness, agreement with a prior). Although the simplest approach to incorporating an auxiliary loss is to sum it with the training loss as a regularizer, recent works have shown that one can improve performance by blending the gradients beyond a simple sum; this is known as gradient surgery. We cast the problem as a constrained minimization problem where the auxiliary objective is…
AI Generated Robotic Content

Recent Posts

Anthropic Confidentially Files for What Could Be the Largest IPO Ever

The AI giant behind Claude submitted paperwork on Monday that would take it public, just…

25 mins ago

New 3D gaze forecasting could help AR devices render scenes before users look

Augmented reality (AR) devices like smart glasses may soon be able to predict where a…

25 mins ago

Does anyone else can’t stand ComfyUI and prefers classic Automatic/Forge UI or it’s just me?

EDIT: I can't believe how many great and useful replies I've got, and not a…

23 hours ago

Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient

This article is divided into four parts; they are: • The Problem with Static Batching…

23 hours ago

Everyone Has Their Targets Set on the MacBook Neo

Dell, Microsoft, and others are unveiling new laptops to compete directly with the Neo, but…

1 day ago

Photon-driven synapse advances low-power neuromorphic systems

Modern artificial intelligence systems rely on moving large amounts of data between memory and processors,…

1 day ago