ProText: A Benchmark Dataset for Measuring (Mis)gendering in Long-Form Texts
We introduce ProText, a dataset for measuring gendering and misgendering in stylistically diverse long-form English texts. ProText spans three dimensions: Theme nouns (names, occupations, titles, kinship terms), Theme category (stereotypically male, stereotypically female, gender-neutral/non-gendered), and Pronoun category (masculine, feminine, gender-neutral, none). The dataset is designed to probe (mis)gendering in text transformations such as summarization and …
Read more “ProText: A Benchmark Dataset for Measuring (Mis)gendering in Long-Form Texts”