ai/ml - Robotic Content

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

by AI Generated Robotic ContentFAANG October 19, 2024Comments are Disabled

*Equal Contributors Current multimodal and multitask foundation models like 4M or UnifiedIO show promising results, but in practice their out-of-the-box abilities to accept diverse inputs and perform diverse tasks are limited by the (usually rather small) number of modalities and tasks they are trained on. In this paper, we significantly expand upon the capabilities of …

Turning Conversation Into Action

by AI Generated Robotic ContentFAANG October 19, 2024Comments are Disabled

Turning Conversation Into Action (Palantir CSE #2) Anchoring AI Agents Into the Enterprise Editor’s Note: This is the second in a three-part blog series about Palantir’s AI-enabled Customer Service Engine. Part 2: Implementation In Part 1 of this three-part blog series, we explored the agentic architecture of the Customer Service Engine (CSE) through the lens of a …

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

by AI Generated Robotic ContentFAANG October 19, 2024Comments are Disabled

This post is co-written Rodrigo Amaral, Ashwin Murthy and Meghan Stronach from Qualcomm. In this post, we introduce an innovative solution for end-to-end model customization and deployment at the edge using Amazon SageMaker and Qualcomm AI Hub. This seamless cloud-to-edge AI development experience will enable developers to create optimized, highly performant, and custom managed machine …

Using Amazon Q Business with AWS HealthScribe to gain insights from patient consultations

by AI Generated Robotic ContentFAANG October 18, 2024Comments are Disabled

With the advent of generative AI and machine learning, new opportunities for enhancement became available for different industries and processes. During re:Invent 2023, we launched AWS HealthScribe, a HIPAA eligible service that empowers healthcare software vendors to build their clinical applications to use speech recognition and generative AI to automatically create preliminary clinician documentation. In …

Scalable Private Search with Wally

by AI Generated Robotic ContentFAANG October 17, 2024Comments are Disabled

This paper presents Wally, a private search system that supports efficient semantic and keyword search queries against large databases. When sufficiently many clients are making queries, Wally’s performance is significantly better than previous systems. In previous private search systems, for each client query, the server must perform at least one expensive cryptographic operation per database …

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

by AI Generated Robotic ContentFAANG October 17, 2024Comments are Disabled

This post was co-written with Lucas Desard, Tom Lauwers, and Sam Landuydt from DPG Media. DPG Media is a leading media company in Benelux operating multiple online platforms and TV channels. DPG Media’s VTM GO platform alone offers over 500 days of non-stop content. With a growing library of long-form video content, DPG Media recognizes …

Beyond the basics: Build real-world gen AI skills with the latest learning paths from Google Cloud

by AI Generated Robotic ContentFAANG October 17, 2024Comments are Disabled

The majority of organizations don’t feel ready for the AI era. In fact, 62% say they don’t have the expertise they need to unlock AI’s full potential.1 As the leader of learning for Google Cloud, the only thing that surprises me about that number is how low it is. I meet with customers every day, …

CAMPHOR: Collaborative Agents for Multi-Input Planning and High-Order Reasoning On Device

by AI Generated Robotic ContentFAANG October 16, 2024Comments are Disabled

While server-side Large Language Models (LLMs) demonstrate proficiency in tool integration and complex reasoning, deploying Small Language Models (SLMs) directly on devices brings opportunities to improve latency and privacy but also introduces unique challenges for accuracy and memory. We introduce CAMPHOR, an innovative on-device SLM multi-agent framework designed to handle multiple user inputs and reason …

Accelerate migration portfolio assessment using Amazon Bedrock

by AI Generated Robotic ContentFAANG October 16, 2024Comments are Disabled

Conducting assessments on application portfolios that need to be migrated to the cloud can be a lengthy endeavor. Despite the existence of AWS Application Discovery Service or the presence of some form of configuration management database (CMDB), customers still face many challenges. These include time taken for follow-up discussions with application teams to review outputs …

Founders share five takeaways from the Google Cloud Startup Summit

by AI Generated Robotic ContentFAANG October 16, 2024Comments are Disabled

We recently hosted our annual Google Cloud Startup Summit, and we were thrilled to showcase a wide range of AI startups leveraging Google Cloud, including Higgsfield AI, Click Therapeutics, Baseten, LiveX AI, Reve AI, and Vellum. As a former co-founder and venture capitalist, I was inspired by the remarkable solutions these startups are developing and …