ProText: A Benchmark Dataset for Measuring (Mis)gendering in Long-Form Texts

We introduce ProText, a dataset for measuring gendering and misgendering in stylistically diverse long-form English texts. ProText spans three dimensions: Theme nouns (names, occupations, titles, kinship terms), Theme category (stereotypically male, stereotypically female, gender-neutral/non-gendered), and Pronoun category (masculine, feminine, gender-neutral, none). The dataset is designed to probe (mis)gendering in text transformations such as summarization and …

ML20076 image 1

Build reliable AI agents with Amazon Bedrock AgentCore Evaluations

Your AI agent worked in the demo, impressed stakeholders, handled test scenarios, and seemed ready for production. Then you deployed it, and the picture changed. Real users experienced wrong tool calls, inconsistent responses, and failure modes nobody anticipated during testing. The result is a gap between expected agent behavior and actual user experience in production. …

Chip-scale light technology could power faster AI and data center communications

Researchers at Trinity have developed a new light-based technology on a tiny chip that could help make the data centers behind cloud computing, artificial intelligence, and global internet services faster and more efficient. In the new research, recently published in Nature Communications, the Trinity team reported one such promising advance with collaborators at the University …

Mugen – Modernized Anime SDXL Base, or how to make Bluvoll tiny bit less sane

Your monthly “Anzhc’s Posts” issue have arrived. Today im introducing – Mugen – continuation of the Flux 2 VAE experiment on SDXL. We have renamed it to signify strong divergence from prior Noobai models, and to finally have a normal name, no more NoobAI-Flux2VAE-Rectified-Flow-v-0.3-oc-gaming-x. In this run in particular we have prioritized character knowledge, and …

Mugen – Modernized Anime SDXL Base, or how to make Bluvoll tiny bit less sane

Your monthly “Anzhc’s Posts” issue have arrived. Today im introducing – Mugen – continuation of the Flux 2 VAE experiment on SDXL. We have renamed it to signify strong divergence from prior Noobai models, and to finally have a normal name, no more NoobAI-Flux2VAE-Rectified-Flow-v-0.3-oc-gaming-x. In this run in particular we have prioritized character knowledge, and …