Categories: Image

Qwen-Image-Edit LoRA training is here + we just dropped our first trained model

Hey everyone! đź‘‹

We just shipped something we’ve been cooking up for a while – full LoRA training support for Qwen-Image-Edit, plus our first trained model is now live on Hugging Face!
What’s new:
âś… Complete training pipeline for Qwen-Image-Edit LoRA adapters
âś… Open-source trainer with easy YAML configs
âś… First trained model: Inscene LoRA specializing in spatial understanding

Why this matters:
Control-based image editing has been getting hot, but training custom LoRA adapters was a pain. Now you can fine-tune Qwen-Image-Edit for your specific use cases with our trainer!

What makes InScene LoRA special:

  • 🎯 Enhanced scene coherence during edits
  • 🎬 Better camera perspective handling
  • 🎭 Improved action sequences within scenes
  • đź§  Smarter spatial understanding

Below are a few examples (the left shows the original model, the right shows the LoRA)

  1. Prompt: Make a shot in the same scene of the left hand securing the edge of the cutting board while the right hand tilts it, causing the chopped tomatoes to slide off into the pan, camera angle shifts slightly to the left to center more on the pan.

https://preview.redd.it/o1j3d1epa8kf1.jpg?width=1462&format=pjpg&auto=webp&s=4139c5b7216266dac38cbff033555713bd31402a

  1. Prompt: Make a shot in the same scene of the chocolate sauce flowing downward from above onto the pancakes, slowly zoom in to capture the sauce spreading out and covering the top pancake, then pan slightly down to show it cascading down the sides.

https://preview.redd.it/zxw3n9f6b8kf1.jpg?width=1672&format=pjpg&auto=webp&s=c229851afbc934538fccc783be7dfa36bd07daac

  1. On the left is the original image, and on the right are the generation results with LoRA, showing the consistency of the shoes and leggings.

Prompt: Make a shot in the same scene of the person moving further away from the camera, keeping the camera steady to maintain focus on the central subject, gradually zooming out to capture more of the surrounding environment as the figure becomes less detailed in the distance.

https://preview.redd.it/2rf7p69ub8kf1.jpg?width=1672&format=pjpg&auto=webp&s=ea5fe2e7bea74645fb7d18c22895a438b958652f

Links:

P.S. – This is just our first LoRA for Qwen Image Edit. We’re planning add more specialized LoRAs for different editing scenarios. What would you like to see next?

submitted by /u/Worldly-Ant-6889
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Remember when hands and eyes used to be a problem? (Workflow included)

Disclaimer: This is my second time posting this. My previous attempt had its video quality…

20 hours ago

TASER: Translation Assessment via Systematic Evaluation and Reasoning

We introduce TASER (Translation Assessment via Systematic Evaluation and Reasoning), a metric that uses Large…

20 hours ago

Inside the AIPCon 8 Demos Redefining the Future of Enterprise AI

Editor’s Note: AIPCon 8, Palantir’s most recent customer conference, featured breakthrough customer implementations that demonstrate…

20 hours ago

Enhance agentic workflows with enterprise search using Kore.ai and Amazon Q Business

This post was written with Meghana Chintalapudi and Surabhi Sankhla of Kore.ai. As organizations struggle…

20 hours ago

Building on the bananas momentum of generative media models on Google Cloud

It’s been exciting to see the capabilities of Nano Banana, our latest image editing model…

20 hours ago

Engineers gain new tool to design complex systems with built-in uncertainty

Designing a complex electronic device like a delivery drone involves juggling many choices, such as…

21 hours ago