Distill Hiatus

2 years ago

After five years, Distill will be taking a break.

Adversarial Reprogramming of Neural Cellular Automata

2 years ago

Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.

Weight Banding

2 years ago

Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.

Last call: Stefan Krawcyzk’s ‘Mastering MLOps’ Live Cohort

2 years ago

Tweet Tweet Share Share Last Updated on August 19, 2022 Sponsored Post   This is your last chance to sign…

Why Initialize a Neural Network with Random Weights?

2 years ago

Tweet Tweet Share Share Last Updated on August 15, 2022 The weights of artificial neural networks must be initialized to…

When to Use MLP, CNN, and RNN Neural Networks

2 years ago

Tweet Tweet Share Share Last Updated on August 15, 2022 What neural network is appropriate for your predictive modeling problem?…

Difference Between a Batch and an Epoch in a Neural Network

2 years ago

Tweet Tweet Share Share Last Updated on August 15, 2022 Stochastic gradient descent is a learning algorithm that has a…

Using Depthwise Separable Convolutions in Tensorflow

2 years ago

Tweet Tweet Share Share Last Updated on August 10, 2022 Looking at all of the very large convolutional neural networks…

Reverse engineering the NTK: towards first-principles architecture design

2 years ago

Deep neural networks have enabled technological wonders ranging from voice recognition to machine transition to protein engineering, but their design…

Why do Policy Gradient Methods work so well in Cooperative MARL? Evidence from Policy Representation

2 years ago

In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy gradient (PG) methods are typically believed to be…