Categories: Image

RecA: A new finetuning method that doesn’t use image captions.

WamVBXxgyuIZPCpGXiKQZANorIFYKFLmI398vBAa Cs

“We introduce Reconstruction Alignment (RecA), a resource-efficient post-training method that leverages visual understanding encoder embeddings as dense “text prompts,” providing rich supervision without captions. Concretely, RecA conditions a UMM on its own visual understanding embeddings and optimizes it to reconstruct the input image with a self-supervised reconstruction loss, thereby realigning understanding and generation.”

https://huggingface.co/sanaka87/BAGEL-RecA

submitted by /u/Total-Resort-3120
[link] [comments]

Unlock multimodal search at scale: Combine text & image power with Vertex AI

January 15, 2025

In "FAANG"

Building In-Video Search

Boris Chen, Ben Klein, Jason Ge, Avneesh Saluja, Guru Tahasildar, Abhishek Soni, Juan Vimberg, Elliot Chow, Amir Ziai, Varun Sekhri, Santiago Castro, Keila Fong, Kelli Griggs, Mallia Sherzai, Robert Mayer, Andy Yao, Vi Iyengar, Jonathan Solorzano-Hamilton, Hossein Taghavi, Ritwik KumarIntroductionToday we’re going to take a look at the behind the scenes…

November 7, 2023

In "FAANG"