Categories: Image

WaTale: A free, fully local visual novel engine (Powered by SD 1.5, LayerDiffuse, and ControlNet)

Hey all. I’ve been working on WaTale, a visual novel app powered by local AI. It combines text, image, and voice models to create fully interactive, branching visual novels entirely on your own hardware.

This is a free to use, hassle-free, fully bundled solution. When relying on the local generation pipeline (Ollama for text, Stable Diffusion 1.5 for images using LayerDiffuse and ControlNet, and Kokoro ONNX for TTS), your stories and character data remain completely private. (There is also optional support for Ollama Cloud/Anthropic/OpenAI APIs if you prefer cloud text models).

The engine handles real-time generation and playback. It renders SD-generated scene backgrounds with depth parallax, full-body transparent character sprites with idle animations, and real-time lip-syncing via face inpainting. You can create custom characters, put yourself in the story, play through generated narratives with integrated minigames, export your stories, or let your characters interact autonomously.

Keep in mind this is an early preview requiring an NVIDIA GPU with at least 4GB of VRAM; you might encounter some bugs and things may break.

Looking for feedback of all types, especially on the Stable Diffusion implementation. You can see demo footage and download the application directly at watale – com. Let me know what you think or if you have any questions about how it works under the hood.

submitted by /u/Churrucaman
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Best Apps for Focus (2026): Focus Friend, Forest, Focus Traveller

Distractions? What distractions? Here are our recommendations for apps that help you stay focused on…

1 hour ago

Comfy raises $30M to continue building the best creative AI tool in open

Hi r/StableDiffusion, Today we’re excited to share that Comfy has raised $30M at a $500M…

1 day ago

Learning Long-Term Motion Embeddings for Efficient Kinematics Generation

Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models…

1 day ago

Scaling Camera File Processing at Netflix

Orchestrating Media Workflows Through Strategic CollaborationAuthors: Eric Reinecke, Bhanu SrikanthIntroduction to Content Hub’s Media Production SuiteAt…

1 day ago

Building Workforce AI Agents with Visier and Amazon Quick

Employees across every function are expected to make faster, better-informed decisions, but the information that…

1 day ago

Day 2 at Google Cloud Next: A marathon developer keynote

At Google Cloud, every day is Developer Day, but none so much as day 2…

1 day ago