Categories: Image

An experiment with “realism” with Wan2.2 that are safe for work images

Got bored seeing the usual women pics every time I opened this sub so decided to make something a little friendlier for the work place. I was loosely working to a theme of “Scandinavian Fishing Town” and wanted to see how far I could get making them feel “realistic”. Yes I am aware there’s all sorts of jank going on, especially in the backgrounds. So when I say “realistic” I don’t mean “flawless”, just that when your eyes first fall on the image it feels pretty real. Some are better than others.

Key points:

  • Used fp8 for high noise and fp16 for low noise on a 4090, which just about filled vram and ram to the max. Wanted to do purely fp16 but memory was having none of it.
  • Had to separate out the SeedVR2 part of the workflow because Comfy wasn’t releasing the ram, so would just OOM on me on every workflow (64gb ram). Having to manually clear the ram after generating the image and before seedVR2. Yes I tried every “Clear Ram” node I could find and none of them worked. Comfy just hordes the ram until it crashes.
  • I found using res_2m/bong_tangent in the high noise stage would create horrible contrasty images, which is why I went with Euler for the high noise part.
  • It uses a lower step count in the high noise. I didn’t really see much benefit increasing the steps there.

If you see any problems in this setup or have suggestions how I should improve it, please fire away. Especially the low noise. I feel like I’m missing something important there.

Included image of the workflow. Images should have it but I think uploading them here will lose it?

submitted by /u/kemb0
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Intel announced new enterprise GPU with 32GB vram

If only it works well with work flow. Nvidia have CUDA, AMD have ROCM, I…

5 hours ago

5 Practical Techniques to Detect and Mitigate LLM Hallucinations Beyond Prompt Engineering

My friend who is a developer once asked an LLM to generate documentation for a…

5 hours ago

Exclusive Self Attention

We introduce exclusive self attention (XSA), a simple modification of self attention (SA) that improves…

5 hours ago

Unlocking video insights at scale with Amazon Bedrock multimodal models

Video content is now everywhere, from security surveillance and media production to social platforms and…

5 hours ago

DRA: A new era of Kubernetes device management with Dynamic Resource Allocation

The explosion of large language models (LLMs) has increased demand for high-performance accelerators like GPUs…

5 hours ago

Amazon Spring Sale Deal: The Typhur Dome 2 Air Fryer Is 30% Off

I tested more than 30 air fryers this past year. The Typhur Dome 2 is…

6 hours ago