1 ML 17700

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

Generative AI models have seen tremendous growth, offering cutting-edge solutions for text generation, summarization, code generation, and question answering. Despite their versatility, these models often struggle when applied to niche or domain-specific tasks because their pre-training is typically based on large, generalized datasets. To address these gaps and maximize their utility in specialized scenarios, fine-tuning …

Image 1 29mhRar.max 1000x1000 1

Generative AI with enterprise controls for business users in 24 Hours

Aible is a leader in generating business impact from AI in less than 30 days, helping teams use AI to extract enterprise value from raw enterprise data with solutions for customer acquisition, churn prevention, demand prediction, preventative maintenance, and more. After previously leveraging BigQuery’s serverless architecture to reduce analytics costs, Aible is now collaborating with …

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval

Neural contextual biasing allows speech recognition models to leverage contextually relevant information, leading to improved transcription accuracy. However, the biasing mechanism is typically based on a cross-attention module between the audio and a catalogue of biasing entries, which means computational complexity can pose severe practical limitations on the size of the biasing catalogue and consequently …

ML 17674 image001

Amazon Bedrock Prompt Management is now available in GA

Today we are announcing the general availability of Amazon Bedrock Prompt Management, with new features that provide enhanced options for configuring your prompts and enabling seamless integration for invoking them in your generative AI applications. Amazon Bedrock Prompt Management simplifies the creation, evaluation, versioning, and sharing of prompts to help developers and prompt engineers get …

1 Gemini Flash Routing Agent v1.max 1000x1000 1

Getting started with NL2SQL (natural language to SQL) with Gemini and BigQuery

The rise of Natural Language Processing (NLP) combined with traditional Structured Query Language (SQL) has given rise to an exciting new technology known as Natural Language to SQL, or NL2SQL, which translates questions phrased in everyday human language into structured SQL queries. Not surprisingly, the technology has tremendous potential to transform the way we interact …

Arch Diagram 1

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation

This post is co-written with Steven Craig from Hearst.  To maintain their competitive edge, organizations are constantly seeking ways to accelerate cloud adoption, streamline processes, and drive innovation. However, Cloud Center of Excellence (CCoE) teams often can be perceived as bottlenecks to organizational transformation due to limited resources and overwhelming demand for their support. In …

DocAI Claims.max 1000x1000 1

Can AI eliminate manual processing for insurance claims? Loadsure built a solution to find

Traditionally, insurance claims processing has been a labor-intensive and time-consuming process, often involving manual verification of documents and data entry. This can lead to delays in claim settlements and a frustrating experience for policyholders.  Loadsure, a global Insurtech firm based in London, recognized the need to address these challenges and sought a solution that would …

On Device Llama 3.1 with Core ML

Many app developers are interested in building on device experiences that integrate increasingly capable large language models (LLMs). Running these models locally on Apple silicon enables developers to leverage the capabilities of the user’s device for cost-effective inference, without sending data to and from third party servers, which also helps protect user privacy. In order …