Categories: FAANG

DataComp: In Search of the Next Generation of Multimodal Datasets

*=Equal Contributors
Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the ML ecosystem, we introduce DataComp, a testbed for dataset experiments centered around a new candidate pool of 12.8 billion image-text pairs from Common Crawl. Participants in our benchmark design new filtering techniques or curate new data sources and then evaluate their new dataset by running our standardized CLIP training…
AI Generated Robotic Content

Recent Posts

How are these AI TikTok dance videos made? (Wan2.1 VACE?)

I saw a reel showing Elsa (and other characters) doing TikTok dances. The animation used…

22 hours ago

Between utopia and collapse: Navigating AI’s murky middle future

AI is disrupting the world, but it also presents an opportunity to ask what we…

23 hours ago

OpenAI Leadership Responds to Meta Offers: ‘Someone Has Broken Into Our Home’

As Mark Zuckerberg lures away top research talent to Meta, OpenAI executives say they're “recalibrating…

23 hours ago

China’s humanoid robots generate more soccer excitement than their human counterparts

While China's men's soccer team hasn't generated much excitement in recent years, humanoid robot teams…

23 hours ago

I’ll definitely try this one out later… oh… it’s already obsolete

submitted by /u/Dry-Resist-4426 [link] [comments]

2 days ago

From hallucinations to hardware: Lessons from a real-world computer vision project gone sideways

What we tried, what didn't work and how a combination of approaches eventually helped us…

2 days ago