| | https://huggingface.co/bytedance-research/UMO Bytedance have released 3 days ago their image editing/creation model UMO. From their huggingface description: Recent advancements in image customization exhibit a wide range of application prospects due to stronger customization capabilities. However, since we humans are more sensitive to faces, a significant challenge remains in preserving consistent identity while avoiding identity confusion with multi-reference images, limiting the identity scalability of customization models. To address this, we present UMO, a Unified Multi-identity Optimization framework, designed to maintain high-fidelity identity preservation and alleviate identity confusion with scalability. With βmulti-to-multi matchingβ paradigm, UMO reformulates multi-identity generation as a global assignment optimization problem and unleashes multi-identity consistency for existing image customization methods generally through reinforcement learning on diffusion models. To facilitate the training of UMO, we develop a scalable customization dataset with multi-reference images, consisting of both synthesised and real parts. Additionally, we propose a new metric to measure identity confusion. Extensive experiments demonstrate that UMO not only improves identity consistency significantly, but also reduces identity confusion on several image customization methods, setting a new state-of-the-art among open-source methods along the dimension of identity preserving. submitted by /u/AgeNo5351 |
This is a LoRA for FLUX Klein Base 9b. **Link: https://civitai.red/models/2381927/flux2-klein-base-9b-smartphone-snapshot-photo-reality-style** All infos on how…
This $2,000 Asus laptop delivers breathtaking performance thanks to Qualcomm's Snapdragon X2 Elite Extreme, but…
ComfyUI with ZIT submitted by /u/Ferwien [link] [comments]
Tovala is a meal kit that comes with a smart oven, or a smart oven…
Now that we have some open weight models available to us that work with character…
This paper was accepted at the Fifth Workshop on Natural Language Generation, Evaluation, and Metrics…