Learning Bias-reduced Word Embeddings Using Dictionary Definitions
Pre-trained word embeddings, such as GloVe, have shown undesirable gender, racial, and religious biases. To address this problem, we propose DD-GloVe, a train-time debiasing algorithm to learn word embeddings by leveraging dictionary definitions. We introduce dictionary-guided loss functions that encourage word embeddings to be similar to their relatively neutral dictionary definition representations. Existing debiasing algorithms …
Read more “Learning Bias-reduced Word Embeddings Using Dictionary Definitions”