Categories: Image

Qwen-Image-Edit LoRA training is here + we just dropped our first trained model

Hey everyone! đź‘‹

We just shipped something we’ve been cooking up for a while – full LoRA training support for Qwen-Image-Edit, plus our first trained model is now live on Hugging Face!
What’s new:
âś… Complete training pipeline for Qwen-Image-Edit LoRA adapters
âś… Open-source trainer with easy YAML configs
âś… First trained model: Inscene LoRA specializing in spatial understanding

Why this matters:
Control-based image editing has been getting hot, but training custom LoRA adapters was a pain. Now you can fine-tune Qwen-Image-Edit for your specific use cases with our trainer!

What makes InScene LoRA special:

  • 🎯 Enhanced scene coherence during edits
  • 🎬 Better camera perspective handling
  • 🎭 Improved action sequences within scenes
  • đź§  Smarter spatial understanding

Below are a few examples (the left shows the original model, the right shows the LoRA)

  1. Prompt: Make a shot in the same scene of the left hand securing the edge of the cutting board while the right hand tilts it, causing the chopped tomatoes to slide off into the pan, camera angle shifts slightly to the left to center more on the pan.

https://preview.redd.it/o1j3d1epa8kf1.jpg?width=1462&format=pjpg&auto=webp&s=4139c5b7216266dac38cbff033555713bd31402a

  1. Prompt: Make a shot in the same scene of the chocolate sauce flowing downward from above onto the pancakes, slowly zoom in to capture the sauce spreading out and covering the top pancake, then pan slightly down to show it cascading down the sides.

https://preview.redd.it/zxw3n9f6b8kf1.jpg?width=1672&format=pjpg&auto=webp&s=c229851afbc934538fccc783be7dfa36bd07daac

  1. On the left is the original image, and on the right are the generation results with LoRA, showing the consistency of the shoes and leggings.

Prompt: Make a shot in the same scene of the person moving further away from the camera, keeping the camera steady to maintain focus on the central subject, gradually zooming out to capture more of the surrounding environment as the figure becomes less detailed in the distance.

https://preview.redd.it/2rf7p69ub8kf1.jpg?width=1672&format=pjpg&auto=webp&s=ea5fe2e7bea74645fb7d18c22895a438b958652f

Links:

P.S. – This is just our first LoRA for Qwen Image Edit. We’re planning add more specialized LoRAs for different editing scenarios. What would you like to see next?

submitted by /u/Worldly-Ant-6889
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Just tried animating a Pokémon TCG card with AI – Wan 2.2 blew my mind

Hey folks, I’ve been playing around with animating PokĂ©mon cards, just for fun. Honestly I…

23 hours ago

Busted by the em dash — AI’s favorite punctuation mark, and how it’s blowing your cover

AI is brilliant at polishing and rephrasing. But like a child with glitter glue, you…

24 hours ago

Scientists Have Identified the Origin of an Extraordinarily Powerful Outer Space Radio Wave

In March 2025 the Earth was hit by a fast radio burst as energetic as…

24 hours ago

Robots can now learn to use tools—just by watching us

Despite decades of progress, most robots are still programmed for specific, repetitive tasks. They struggle…

24 hours ago

Sharing that workflow [Remake Attempt]

I took a stab at recreating that person's work but including a workflow. Workflow download…

2 days ago

SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding

We introduce SlowFast-LLaVA-1.5 (abbreviated as SF-LLaVA-1.5), a family of video large language models (LLMs) offering…

2 days ago