Categories: FAANG

DataComp: In Search of the Next Generation of Multimodal Datasets

*=Equal Contributors
Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the ML ecosystem, we introduce DataComp, a testbed for dataset experiments centered around a new candidate pool of 12.8 billion image-text pairs from Common Crawl. Participants in our benchmark design new filtering techniques or curate new data sources and then evaluate their new dataset by running our standardized CLIP training…
AI Generated Robotic Content

Recent Posts

SpecMD: A Comprehensive Study on Speculative Expert Prefetching

Mixture-of-Experts (MoE) models enable sparse expert activation, meaning that only a subset of the model’s…

4 hours ago

Cost effective deployment of vision-language models for pet behavior detection on AWS Inferentia2

Tomofun, the Taiwan-headquartered pet-tech startup behind the Furbo Pet Camera, is redefining how pet owners…

4 hours ago

Pioneering AI-assisted code migration: How Google achieved 6x faster migration from TensorFlow to JAX

AI coding agents are rapidly becoming ubiquitous across the software industry, fundamentally changing how developers…

4 hours ago

Elon Musk’s Last-Ditch Effort to Control OpenAI: Recruit Sam Altman to Tesla

Messages between Shivon Zilis and Tesla executives reveal plans in 2017 to start a rival…

5 hours ago

AI training method helps robots carry lab-learned skills into real-world tasks

Robots are trained for specific tasks, such as cutting, using simulation. However, collecting real-world data…

5 hours ago