Categories: FAANG

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

*Equal Contributors
Current multimodal and multitask foundation models like 4M or UnifiedIO show promising results, but in practice their out-of-the-box abilities to accept diverse inputs and perform diverse tasks are limited by the (usually rather small) number of modalities and tasks they are trained on. In this paper, we significantly expand upon the capabilities of 4M by training it on tens of highly diverse modalities and by performing co-training on large-scale multimodal datasets and text corpora. This includes training on several semantic and geometric modalities, feature maps from…
AI Generated Robotic Content

Recent Posts

Krea co-founder is considering open-sourcing their new model trained in collaboration with Black Forest Labs – Maybe go there and leave an encouraging comment?

https://preview.redd.it/j6qshjdiao7f1.jpg?width=1182&format=pjpg&auto=webp&s=9f5da751e086c7c3a8cd882f5b7648211daae50c https://reddit.com/link/1leexi9/video/bs096nikao7f1/player Link to the post: https://x.com/viccpoes/status/1934983545233277428 submitted by /u/LatentSpacer [link] [comments]

21 hours ago

Correcting the Record: Palantir’s Support to the US Government is Not a Political Football

Editor’s Note: This post provides a detailed rebuttal of the multitude of misguided assertions presented…

21 hours ago

Meeting summarization and action item extraction with Amazon Nova

Meetings play a crucial role in decision-making, project coordination, and collaboration, and remote meetings are…

21 hours ago

Gemini momentum continues with launch of 2.5 Flash-Lite and general availability of 2.5 Flash and Pro on Vertex AI

The momentum of the Gemini 2.5 era continues to build. Following our recent announcements, we're…

21 hours ago

OpenAI open sourced a new Customer Service Agent framework — learn more about its growing enterprise strategy

By offering transparent tooling and clear implementation examples, OpenAI is pushing agentic systems out of…

22 hours ago