Categories: FAANG

DataComp: In Search of the Next Generation of Multimodal Datasets

*=Equal Contributors
Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the ML ecosystem, we introduce DataComp, a testbed for dataset experiments centered around a new candidate pool of 12.8 billion image-text pairs from Common Crawl. Participants in our benchmark design new filtering techniques or curate new data sources and then evaluate their new dataset by running our standardized CLIP training…
AI Generated Robotic Content

Recent Posts

Anima-Base is magic and i don’t think people realize how good it is.

I made a post about ZIT earlier this month, but i think its time ANIMA…

6 hours ago

Technical deep dive: AgentCore payments and innovation in agentic commerce

The industry is entering a world where billions of generative AI agents operate autonomously, acting…

6 hours ago

Pope Leo Schooled the Tech Bros on Tolkien

The Holy Father referenced The Lord of the Rings in his encyclical about AI—an expert…

7 hours ago

AI beats human forecasters in tournament predicting 30 tech ventures

For decades, the idea that artificial intelligence can beat humans at number-crunching tasks like high-frequency…

7 hours ago

Testing ZIT and Flux-1 with “NVIDIA PiD — Pixel Diffusion Decoder”

Just tested NVIDIA-PiD with 512px generated images and 1024 generated image downscaled to 512, because…

1 day ago