Categories: FAANG

Reducing Bias and Improving Safety in DALL·E 2

Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s population. This technique is applied at the system level when DALL·E is given a prompt describing a person that does not specify race or gender, like “firefighter.”

Based on our internal evaluation, users were 12× more likely to say that DALL·E images included people of diverse backgrounds after the technique was applied. We plan to improve this technique over time as we gather more data and feedback.


A photo of a CEO

In April, we started previewing the DALL·E 2 research to a limited number of people, which has allowed us to better understand the system’s capabilities and limitations and improve our safety systems.

During this preview phase, early users have flagged sensitive and biased images which have helped inform and evaluate this new mitigation.

We are continuing to research how AI systems, like DALL·E, might reflect biases in its training data and different ways we can address them.

During the research preview we have taken other steps to improve our safety systems, including:

  • Minimizing the risk of DALL·E being misused to create deceptive content by rejecting image uploads containing realistic faces and attempts to create the likeness of public figures, including celebrities and prominent political figures.
  • Making our content filters more accurate so that they are more effective at blocking prompts and image uploads that violate our content policy while still allowing creative expression.
  • Refining automated and human monitoring systems to guard against misuse.

These improvements have helped us gain confidence in the ability to invite more users to experience DALL·E.

Expanding access is an important part of our deploying AI systems responsibly because it allows us to learn more about real-world use and continue to iterate on our safety systems.

var loadImages = async function(oldId, newId) { return new Promise(function(resolve, reject) { var newUrls = images.map(function(img) { return img.src.replace(oldId, newId); }); var remaining = newUrls.length; var newImages = [];

for (var i = 0; i < newUrls.length; i++) { var img = new Image(); img.onload = function() { --remaining; newImages.push(img); if (remaining <= 0) { resolve(newImages); } }; img.src = newUrls[i]; } }); } var handleToggler = async function(e) { var target = e.currentTarget; var oldActiveToggler = document.querySelector('.js-toggler.active'); // Update toggle active state oldActiveToggler.classList.remove('active'); oldActiveToggler.setAttribute('aria-selected', 'false'); target.classList.add('active'); target.setAttribute('aria-selected', 'true'); // Update prompt input prompt.innerText = target.dataset.jsPrompt; // Enable loading animation const animationTimeout = setTimeout(function() { gallery.classList.add('loading'); }, 250); // Load new images in the background await loadImages(oldActiveToggler.dataset.jsId, target.dataset.jsId); clearTimeout(animationTimeout); // Remove loading animation and update images images.forEach(function(image, index) { image.setAttribute('src', image.src.replace(oldActiveToggler.dataset.jsId, target.dataset.jsId)); }); gallery.classList.remove('loading'); }; document.addEventListener('DOMContentLoaded', function() { [...document.querySelectorAll('.js-toggler')].forEach(function(toggler) { toggler.addEventListener('click', handleToggler); toggler.addEventListener('mouseenter', function(e) { var target = e.currentTarget; var oldActiveToggler = document.querySelector('.js-toggler.active'); loadImages(oldActiveToggler.dataset.jsId, target.dataset.jsId); }); }); });

AI Generated Robotic Content

Recent Posts

3 Nuclear Startups Hit a Big Milestone. Why It Matters—and Why It Doesn’t

The companies’ Fourth of July plans include celebrating new reactor designs coming online. But there’s…

20 hours ago

Context vs. Memory Engineering in Agentic AI Systems

Compression on Arrival Tool outputs should be compressed after a call returns, not after the…

2 days ago

Why I disappeared for 3 Months & What’s Next

I’ve been quiet since November because I’ve been building.Over the past few months, AI has…

2 days ago

Multi-Agent Teams Hold Experts Back

Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than…

2 days ago

Managing Elasticsearch Reindex at Scale: Performance, Reliability, and Observability

Editor’s Note: This is the fourth post in a series exploring how Palantir customizes infrastructure…

2 days ago

GenPage: Towards End-to-End Generative Homepage Construction at Netflix

Authors: Lequn Wang, Jiangwei Pan, and Linas BaltrunasFigure 1. Autoregressive homepage generation. GenPage builds a…

2 days ago