I wrote this paper two years ago: https://arxiv.org/abs/2106.09685
Super happy that people find it useful for diffusion models.
I had text in mind when I wrote the paper, so there are probably things we can tweak to make LoRA more suited for image generation. I want to better understand how exactly LoRA is used in diffusion models and its shortcomings.
Any thoughts?
submitted by /u/edwardjhu
[link] [comments]
In the past weeks, I've been tweaking Wan to get really good at video inpainting.…
Deep Think utilizes extended, parallel thinking and novel reinforcement learning techniques for significantly improved problem-solving.
At AWS Summit New York City 2025, Amazon Web Services (AWS) announced the preview of…
Cohere's Command A Vision can read graphs and PDFs to make enterprise research richer and…
OpenAI lost access to the Claude API this week after Anthropic claimed the company was…
A new artificial intelligence (AI) tool could make it much easier—and cheaper—for doctors and researchers…