Categories: Image

Stable Diffusion v2-1-unCLIP model released

Information taken from the GitHub page: https://github.com/Stability-AI/stablediffusion/blob/main/doc/UNCLIP.MD

HuggingFace checkpoints and diffusers integration: https://huggingface.co/stabilityai/stable-diffusion-2-1-unclip

Public web-demo: https://clipdrop.co/stable-diffusion-reimagine

unCLIP is the approach behind OpenAI’s DALL·E 2, trained to invert CLIP image embeddings. We finetuned SD 2.1 to accept a CLIP ViT-L/14 image embedding in addition to the text encodings. This means that the model can be used to produce image variations, but can also be combined with a text-to-image embedding prior to yield a full text-to-image model at 768×768 resolution.

If you would like to try a demo of this model on the web, please visit https://clipdrop.co/stable-diffusion-reimagine

This model essentially uses an input image as the ‘prompt’ rather than require a text prompt. It does this by first converting the input image into a ‘CLIP embedding’, and then feeds this into a stable diffusion 2.1-768 model fine-tuned to produce an image from such CLIP embeddings, enabling a users to generate multiple variations of a single image this way. Note that this is distinct from how img2img does it (the structure of the original image is generally not kept).

Blog post: https://stability.ai/blog/stable-diffusion-reimagine

submitted by /u/hardmaru
[link] [comments]

More Prompting Techniques for Stable Diffusion

The image diffusion model, in its simplest form, generates an image from the prompt. The prompt can be a text prompt or an image as long as a suitable encoder is available to convert it into a tensor that the model can use as a condition to guide the generation…

May 6, 2024

In "AI/ML Research"

Stable Diffusion Reimagine

Stability AI is excited to announce the launch of Stable Diffusion Reimagine! We invite users to experiment with images and ‘reimagine’ their designs through Stable Diffusion.Stable Diffusion Reimagine is a new Clipdrop tool that allows users to generate multiple variations of a single image without limits. No need for complex…

March 18, 2023

In "Image"

GPT-4 + Stable-Diffusion = ?: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

May 26, 2023

In "AI/ML Research"

AI Generated Robotic Content

Next 5 ways OpenAI’s ChatGPT plugins could change the AI game | The AI Beat »

Previous « Uncovering new opportunities with edge AI

Published by

AI Generated Robotic Content

Tags: ai images

3 years ago

Brad Pitt casts Elliot for Achilles – an Ai acting performance experiment

I am putting most of my efforts to achieve more realistic Ai acting with natural…

3 hours ago

AI/ML News

New light-based switch could cut chip energy use and speed future AI photonics

Photonic devices are hardware systems that can process information using light instead of electricity. These…

4 hours ago

Image

Microsoft Lens First Tests: It’s Pretty Decent! – ComfyUI Native Support About to Be Merged

Model weights: https://huggingface.co/Comfy-Org/Lens PR: https://github.com/Comfy-Org/ComfyUI/pull/14077 You'll need to git the merge pull request if you're…

1 day ago