Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks

2 years ago

Quantization, knowledge distillation, and magnitude pruning are among the most popular methods for neural network compression in NLP. Independently, these…

Integrating Categorical Features in End-To-End ASR

2 years ago

All-neural, end-to-end ASR systems gained rapid interest from the speech recognition community. Such systems convert speech input to text units…

Space-Efficient Representation of Entity-centric Query Language Models

2 years ago

Virtual assistants make use of automatic speech recognition (ASR) to help users answer entity-centric queries. However, spoken entity recognition is…

CVNets: High Performance Library for Computer Vision

2 years ago

We introduce CVNets, a high-performance open-source library for training deep neural networks for visual recognition tasks, including classification, detection, and…

FORML: Learning to Reweight Data for Fairness

2 years ago

Machine learning models are trained to minimize the mean loss for a single metric, and thus typically do not consider…

Regularized Training of Nearest Neighbor Language Models

2 years ago

Including memory banks in a natural language processing architecture increases model capacity by equipping it with additional data at inference…

A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing

2 years ago

A key algorithm for understanding the world is material segmentation, which assigns a label (metal, glass, etc.) to each pixel.…

Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting

2 years ago

The practical success of overparameterized neural networks has motivated the recent scientific study of interpolating methods, which perfectly fit their…

Understanding Convolutions on Graphs

2 years ago

Understanding the building blocks and design choices of graph neural networks.

A Gentle Introduction to Graph Neural Networks

2 years ago

What components are needed for building learning algorithms that leverage the structure and properties of graphs?