Just wondering, this has been a head-scratcher for me for a while.
Everywhere I look claims DoRA is superior to LoRA in what seems like all aspects. It doesn’t require more power or resources to train.
I googled DoRA training for newer models – Wan, Qwen, etc. Didn’t find anything, except a reddit post from a year ago asking pretty much exactly what I’m asking here today lol. And every comment seems to agree DoRA is superior. And Comfy has supported DoRA now for a long time.
Yet, here we are – still training LoRAs when there’s been a better option for years? This community is always fairly quick to adopt the latest and greatest. It’s odd this slipped through? I use diffusion-pipe to train pretty much everything now. I’m curious to know if theres a way I could train DoRAs with that. Or if there is a different method out there right now that is capable of training a wan DoRA.
Thanks for any insight, and curious to hear others opinions on this.
Edit: very insightful and interesting responses, my opinion has definitely shifted. @roger_ducky has a great explanation of DoRA drawbacks I was unaware of. Also cool to hear from people who had worse results than LoRA training using the same dataset/params. It sounds like sometimes LoRA is better, and sometimes DoRA is better, but DoRA is certainly not better in every instance – as I was initially led to believe. But still feels like DoRAs deserve more exploration and testing than they’ve had, especially with newer models.
submitted by /u/Realistic_Rabbit5429
[link] [comments]
The companies’ Fourth of July plans include celebrating new reactor designs coming online. But there’s…
Compression on Arrival Tool outputs should be compressed after a call returns, not after the…
I’ve been quiet since November because I’ve been building.Over the past few months, AI has…
Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than…
Editor’s Note: This is the fourth post in a series exploring how Palantir customizes infrastructure…
Authors: Lequn Wang, Jiangwei Pan, and Linas BaltrunasFigure 1. Autoregressive homepage generation. GenPage builds a…