Categories: Image

Why are we still training LoRA and not moved to DoRA as a standard?

Just wondering, this has been a head-scratcher for me for a while.

Everywhere I look claims DoRA is superior to LoRA in what seems like all aspects. It doesn’t require more power or resources to train.

I googled DoRA training for newer models – Wan, Qwen, etc. Didn’t find anything, except a reddit post from a year ago asking pretty much exactly what I’m asking here today lol. And every comment seems to agree DoRA is superior. And Comfy has supported DoRA now for a long time.

Yet, here we are – still training LoRAs when there’s been a better option for years? This community is always fairly quick to adopt the latest and greatest. It’s odd this slipped through? I use diffusion-pipe to train pretty much everything now. I’m curious to know if theres a way I could train DoRAs with that. Or if there is a different method out there right now that is capable of training a wan DoRA.

Thanks for any insight, and curious to hear others opinions on this.

Edit: very insightful and interesting responses, my opinion has definitely shifted. @roger_ducky has a great explanation of DoRA drawbacks I was unaware of. Also cool to hear from people who had worse results than LoRA training using the same dataset/params. It sounds like sometimes LoRA is better, and sometimes DoRA is better, but DoRA is certainly not better in every instance – as I was initially led to believe. But still feels like DoRAs deserve more exploration and testing than they’ve had, especially with newer models.

submitted by /u/Realistic_Rabbit5429
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

3 Nuclear Startups Hit a Big Milestone. Why It Matters—and Why It Doesn’t

The companies’ Fourth of July plans include celebrating new reactor designs coming online. But there’s…

19 hours ago

Context vs. Memory Engineering in Agentic AI Systems

Compression on Arrival Tool outputs should be compressed after a call returns, not after the…

2 days ago

Why I disappeared for 3 Months & What’s Next

I’ve been quiet since November because I’ve been building.Over the past few months, AI has…

2 days ago

Multi-Agent Teams Hold Experts Back

Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than…

2 days ago

Managing Elasticsearch Reindex at Scale: Performance, Reliability, and Observability

Editor’s Note: This is the fourth post in a series exploring how Palantir customizes infrastructure…

2 days ago

GenPage: Towards End-to-End Generative Homepage Construction at Netflix

Authors: Lequn Wang, Jiangwei Pan, and Linas BaltrunasFigure 1. Autoregressive homepage generation. GenPage builds a…

2 days ago