faang

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

This post is co-written with HyeKyung Yang, Jieun Lim, and SeungBum Shim from LotteON. LotteON aims to be a platform…

2 years ago

To tune or not to tune? A guide to leveraging your data with LLMs

Customers tell us they see great potential in using large language models (LLMs) with their data to improve customer experiences,…

2 years ago

AI on Air: Exploring GPT-4o

DALL-E illustration showcasing my audio demos and conversations with GPT-4oThis week, OpenAI announced the release of GPT-4o, the latest iteration of…

2 years ago

KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation

Large Language Model or LLM inference has two phases, the prompt (or prefill) phase to output the first token and…

2 years ago

The power of remote engine execution for ETL/ELT data pipelines

Business leaders risk compromising their competitive edge if they do not proactively implement generative AI (gen AI). However, businesses scaling AI face…

2 years ago

Build a serverless exam generator application from your own lecture content using Amazon Bedrock

Crafting new questions for exams and quizzes can be tedious and time-consuming for educators. The time required varies based on…

2 years ago

Announcing general availability of Ray on Vertex AI

Developers and engineers face several major challenges when scaling AI/ML workloads. One challenge is getting access to the AI infrastructure…

2 years ago

Needle-Moving AI Research Trains Surgical Robots in Simulation

A collaboration between NVIDIA and academic researchers is prepping robots for surgery. ORBIT-Surgical — developed by researchers from the University…

2 years ago

OpenAI Created Her: The Birth of GPT-4o

Image generated with Midjourney. In a groundbreaking move, OpenAI has unveiled GPT-4o, a revolutionary model that marks a significant leap…

2 years ago

Gemini breaks new ground: a faster model, longer context and AI agents

We’re introducing a series of updates across the Gemini family of models, including the new 1.5 Flash, our lightweight model…

2 years ago