Categories: Image

Qwen-Image-Edit LoRA training is here + we just dropped our first trained model

Hey everyone! đź‘‹

We just shipped something we’ve been cooking up for a while – full LoRA training support for Qwen-Image-Edit, plus our first trained model is now live on Hugging Face!
What’s new:
âś… Complete training pipeline for Qwen-Image-Edit LoRA adapters
âś… Open-source trainer with easy YAML configs
âś… First trained model: Inscene LoRA specializing in spatial understanding

Why this matters:
Control-based image editing has been getting hot, but training custom LoRA adapters was a pain. Now you can fine-tune Qwen-Image-Edit for your specific use cases with our trainer!

What makes InScene LoRA special:

  • 🎯 Enhanced scene coherence during edits
  • 🎬 Better camera perspective handling
  • 🎭 Improved action sequences within scenes
  • đź§  Smarter spatial understanding

Below are a few examples (the left shows the original model, the right shows the LoRA)

  1. Prompt: Make a shot in the same scene of the left hand securing the edge of the cutting board while the right hand tilts it, causing the chopped tomatoes to slide off into the pan, camera angle shifts slightly to the left to center more on the pan.

https://preview.redd.it/o1j3d1epa8kf1.jpg?width=1462&format=pjpg&auto=webp&s=4139c5b7216266dac38cbff033555713bd31402a

  1. Prompt: Make a shot in the same scene of the chocolate sauce flowing downward from above onto the pancakes, slowly zoom in to capture the sauce spreading out and covering the top pancake, then pan slightly down to show it cascading down the sides.

https://preview.redd.it/zxw3n9f6b8kf1.jpg?width=1672&format=pjpg&auto=webp&s=c229851afbc934538fccc783be7dfa36bd07daac

  1. On the left is the original image, and on the right are the generation results with LoRA, showing the consistency of the shoes and leggings.

Prompt: Make a shot in the same scene of the person moving further away from the camera, keeping the camera steady to maintain focus on the central subject, gradually zooming out to capture more of the surrounding environment as the figure becomes less detailed in the distance.

https://preview.redd.it/2rf7p69ub8kf1.jpg?width=1672&format=pjpg&auto=webp&s=ea5fe2e7bea74645fb7d18c22895a438b958652f

Links:

P.S. – This is just our first LoRA for Qwen Image Edit. We’re planning add more specialized LoRAs for different editing scenarios. What would you like to see next?

submitted by /u/Worldly-Ant-6889
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

3 Months later – Proof of concept for making comics with Krita AI and other AI tools

Some folks might remember this post I made a few short months ago where I…

22 hours ago

NASA Delays Launch of Artemis II Lunar Mission Once Again

A failure in the helium flow of the SLS rocket has prompted NASA to delay…

23 hours ago

Jailbreaking the matrix: How researchers are bypassing AI guardrails to make them safer

A paper written by University of Florida Computer & Information Science & Engineering, or CISE,…

23 hours ago

Turns out LTX-2 makes a very good video upscaler for WAN

I have had a lot of fun with LTX but for a lot of usecases…

2 days ago

Sony’s WH-CH720N headphones offer excellent value at full price, but right now they’re a steal.

Sony’s WH-CH720N headphones offer excellent value at full price, but right now they're a steal.

2 days ago

AI model edits can leak sensitive data via update ‘fingerprints’

Artificial intelligence (AI) systems are now widely used by millions of people worldwide, as tools…

2 days ago