Categories: Image

Qwen-Image-Edit LoRA training is here + we just dropped our first trained model

Hey everyone! đź‘‹

We just shipped something we’ve been cooking up for a while – full LoRA training support for Qwen-Image-Edit, plus our first trained model is now live on Hugging Face!
What’s new:
âś… Complete training pipeline for Qwen-Image-Edit LoRA adapters
âś… Open-source trainer with easy YAML configs
âś… First trained model: Inscene LoRA specializing in spatial understanding

Why this matters:
Control-based image editing has been getting hot, but training custom LoRA adapters was a pain. Now you can fine-tune Qwen-Image-Edit for your specific use cases with our trainer!

What makes InScene LoRA special:

  • 🎯 Enhanced scene coherence during edits
  • 🎬 Better camera perspective handling
  • 🎭 Improved action sequences within scenes
  • đź§  Smarter spatial understanding

Below are a few examples (the left shows the original model, the right shows the LoRA)

  1. Prompt: Make a shot in the same scene of the left hand securing the edge of the cutting board while the right hand tilts it, causing the chopped tomatoes to slide off into the pan, camera angle shifts slightly to the left to center more on the pan.

https://preview.redd.it/o1j3d1epa8kf1.jpg?width=1462&format=pjpg&auto=webp&s=4139c5b7216266dac38cbff033555713bd31402a

  1. Prompt: Make a shot in the same scene of the chocolate sauce flowing downward from above onto the pancakes, slowly zoom in to capture the sauce spreading out and covering the top pancake, then pan slightly down to show it cascading down the sides.

https://preview.redd.it/zxw3n9f6b8kf1.jpg?width=1672&format=pjpg&auto=webp&s=c229851afbc934538fccc783be7dfa36bd07daac

  1. On the left is the original image, and on the right are the generation results with LoRA, showing the consistency of the shoes and leggings.

Prompt: Make a shot in the same scene of the person moving further away from the camera, keeping the camera steady to maintain focus on the central subject, gradually zooming out to capture more of the surrounding environment as the figure becomes less detailed in the distance.

https://preview.redd.it/2rf7p69ub8kf1.jpg?width=1672&format=pjpg&auto=webp&s=ea5fe2e7bea74645fb7d18c22895a438b958652f

Links:

P.S. – This is just our first LoRA for Qwen Image Edit. We’re planning add more specialized LoRAs for different editing scenarios. What would you like to see next?

submitted by /u/Worldly-Ant-6889
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

We may have a new SOTA open-source model: ERNIE-Image Comparisons

Base model is definitely SOTA, can even easily compete with closed-source ones in terms of…

1 hour ago

Navigating the generative AI journey: The Path-to-Value framework from AWS

Generative AI is reshaping how organizations approach productivity, customer experiences, and operational capabilities. Across industries,…

1 hour ago

The Surprising MacBook Neo Competitor You’ve Never Heard Of

In many ways, the HP OmniBook 5 is a better budget laptop than the MacBook…

2 hours ago

Tiny cameras in earbuds let users talk with AI about what they see

University of Washington researchers developed the first system that incorporates tiny cameras in off-the-shelf wireless…

2 hours ago

Update: Distilled v1.1 is live

We've pushed an LTX-2.3 update today. The Distilled model has been retrained (now v1.1) with…

1 day ago

How to Implement Tool Calling with Gemma 4 and Python

The open-weights model ecosystem shifted recently with the release of the

1 day ago