Categories: Image

Qwen-Image-Edit LoRA training is here + we just dropped our first trained model

Hey everyone! 👋

We just shipped something we’ve been cooking up for a while – full LoRA training support for Qwen-Image-Edit, plus our first trained model is now live on Hugging Face!
What’s new:
✅ Complete training pipeline for Qwen-Image-Edit LoRA adapters
✅ Open-source trainer with easy YAML configs
✅ First trained model: Inscene LoRA specializing in spatial understanding

Why this matters:
Control-based image editing has been getting hot, but training custom LoRA adapters was a pain. Now you can fine-tune Qwen-Image-Edit for your specific use cases with our trainer!

What makes InScene LoRA special:

  • 🎯 Enhanced scene coherence during edits
  • 🎬 Better camera perspective handling
  • 🎭 Improved action sequences within scenes
  • 🧠 Smarter spatial understanding

Below are a few examples (the left shows the original model, the right shows the LoRA)

  1. Prompt: Make a shot in the same scene of the left hand securing the edge of the cutting board while the right hand tilts it, causing the chopped tomatoes to slide off into the pan, camera angle shifts slightly to the left to center more on the pan.

https://preview.redd.it/o1j3d1epa8kf1.jpg?width=1462&format=pjpg&auto=webp&s=4139c5b7216266dac38cbff033555713bd31402a

  1. Prompt: Make a shot in the same scene of the chocolate sauce flowing downward from above onto the pancakes, slowly zoom in to capture the sauce spreading out and covering the top pancake, then pan slightly down to show it cascading down the sides.

https://preview.redd.it/zxw3n9f6b8kf1.jpg?width=1672&format=pjpg&auto=webp&s=c229851afbc934538fccc783be7dfa36bd07daac

  1. On the left is the original image, and on the right are the generation results with LoRA, showing the consistency of the shoes and leggings.

Prompt: Make a shot in the same scene of the person moving further away from the camera, keeping the camera steady to maintain focus on the central subject, gradually zooming out to capture more of the surrounding environment as the figure becomes less detailed in the distance.

https://preview.redd.it/2rf7p69ub8kf1.jpg?width=1672&format=pjpg&auto=webp&s=ea5fe2e7bea74645fb7d18c22895a438b958652f

Links:

P.S. – This is just our first LoRA for Qwen Image Edit. We’re planning add more specialized LoRAs for different editing scenarios. What would you like to see next?

submitted by /u/Worldly-Ant-6889
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

I love Qwen

It is far more likely that a woman underwater is wearing at least a bikini…

17 hours ago

100% Unemployment is Inevitable*

TL;DR AI is already raising unemployment in knowledge industries, and if AI continues progressing toward…

17 hours ago

Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures

The canonical approach in generative modeling is to split model fitting into two blocks: define…

17 hours ago

Streamline AI operations with the Multi-Provider Generative AI Gateway reference architecture

As organizations increasingly adopt AI capabilities across their applications, the need for centralized management, security,…

17 hours ago

BigQuery AI: The convergence of data and AI is here

From uncovering new insights in multimodal data to personalizing customer experiences, AI is emerging as…

17 hours ago

OpenAI is ending API access to fan-favorite GPT-4o model in February 2026

OpenAI has sent out emails notifying API customers that its chatgpt-4o-latest model will be retired…

18 hours ago