Categories: FAANG

Layer-Wise Data-Free CNN Compression

We present an efficient method for compressing a trained neural network without using any data. Our data-free method requires 14x-450x fewer FLOPs than comparable state-of-the-art methods. We break the problem of data-free network compression into a number of independent layer-wise compressions. We show how to efficiently generate layer-wise training data, and how to precondition the network to maintain accuracy during layer-wise compression. We show state-of-the-art performance on MobileNetV1 for data-free low-bit-width quantization. We also show state-of-the-art performance on data-free…
AI Generated Robotic Content

Recent Posts

Automated Feature Engineering in PyCaret

Automated feature engineering in

22 hours ago

Updating the Frontier Safety Framework

Our next iteration of the FSF sets out stronger security protocols on the path to…

22 hours ago

Adaptive Training Distributions with Scalable Online Bilevel Optimization

Large neural networks pretrained on web-scale corpora are central to modern machine learning. In this…

22 hours ago

Orchestrate seamless business systems integrations using Amazon Bedrock Agents

Generative AI has revolutionized technology through generating content and solving complex problems. To fully take…

22 hours ago

Helping our partners co-market faster with AI

At Google Cloud, we're deeply invested in making AI helpful to organizations everywhere — not…

22 hours ago

AMD’s Q4 revenue hits $7.66B, up 24% but stock falls

Advanced Micro Devices reported revenue of $7.658 billion for the fourth quarter, up 24% from…

23 hours ago