1 RAG Conceptual Diagram FINAL.max 1000x1000 1

RAG in production faster with Ray, LangChain and HuggingFace

We’re excited to announce the release of a quickstart solution and reference architecture for retrieval augmented generation (RAG) applications, designed to accelerate your journey to production. In this post, you’ll learn how to quickly deploy a complete RAG application on Google Kubernetes Engine (GKE), and Cloud SQL for PostgreSQL and pgvector, using Ray, LangChain, and …

NVIDIA AI Microservices for Drug Discovery, Digital Health Now Integrated With AWS

Harnessing optimized AI models for healthcare is easier than ever as NVIDIA NIM, a collection of cloud-native microservices, integrates with Amazon Web Services. NIM, part of the NVIDIA AI Enterprise software platform available on AWS Marketplace, enables developers to access a growing library of AI models through industry-standard application programming interfaces, or APIs. The library …

Random robots are more reliable

New algorithm encourages robots to move more randomly to collect more diverse data for learning. In tests, robots started with no knowledge and then learned and correctly performed tasks within a single attempt. New model could improve safety and practicality of self-driving cars, delivery drones and more.

When can transformers reason with abstract symbols?

We investigate the capabilities of transformer models on relational reasoning tasks. In these tasks, models are trained on a set of strings encoding abstract relations, and are then tested out-of-distribution on data that contains symbols that did not appear in the training dataset. We prove that for any relational reasoning task in a large family …

How generative AI will revolutionize supply chain 

Unlocking the full potential of supply chain management has long been a goal for businesses that seek efficiency, resilience and sustainability. In the age of digital transformation, the integration of advanced technologies like generative artificial intelligence brings a new era of innovation and optimization. AI tools help users address queries and resolve alerts by using …

marco 100

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Large language models (LLMs) are making a significant impact in the realm of artificial intelligence (AI). Their impressive generative abilities have led to widespread adoption across various sectors and use cases, including content generation, sentiment analysis, chatbot development, and virtual assistant technology. Llama2 by Meta is an example of an LLM offered by AWS. Llama …

2024 Gartner CAIDS MQ graphic.max 1000x1000 1

Google is a Leader in the 2024 Gartner® Magic Quadrant™ for Cloud AI Developer Services

For the fifth consecutive year, Gartner® has named Google as a Leader in the Magic Quadrant™ for Cloud AI Developer Services (CAIDS). We believe this is a testament to our history of offering innovative AI products and delivering continuous improvement for our customers. Download the complimentary 2024 Gartner Magic Quadrant™ for Cloud AI Developer Services …