FAANG

Vertex AI grounding: More reliable models, fewer hallucinations

At the Gemini for Work event in September, we showcased how generative AI is transforming the way enterprises work. Across…

3 weeks ago

Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API

We are excited to announce the availability of Cohere’s advanced reranking model Rerank 3.5 through our new Rerank API in…

3 weeks ago

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

The new efficient multi-adapter inference feature of Amazon SageMaker unlocks exciting possibilities for customers using fine-tuned models. This capability integrates…

3 weeks ago

Search enterprise data assets using LLMs backed by knowledge graphs

Enterprises are facing challenges in accessing their data assets scattered across various sources because of increasing complexities in managing vast…

4 weeks ago

Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?

This paper was accepted at the Ninth Conference on Machine Translation (WMT24) at EMNLP 2024. The prosody of a spoken…

4 weeks ago

Ethical AI in Defense Decision Support Systems

Ethical AI in Defense Decision Support Systems (Defense AI Ethics, #2)Editors Note: In a previous post in our series on ethical…

4 weeks ago

Unleash your Salesforce data using the Amazon Q Salesforce Online connector

Thousands of companies worldwide use Salesforce to manage their sales, marketing, customer service, and other business operations. The Salesforce cloud-based…

4 weeks ago

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

We’re excited to announce the availability of Meta Llama 3.1 8B and 70B inference support on AWS Trainium and AWS…

4 weeks ago

Multimodal Autoregressive Pre-Training of Large Vision Encoders

*Equal Contributors A dominant paradigm in large multimodal models is to pair a large language de- coder with a vision…

1 month ago

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative…

1 month ago