Text Generation with GPT-2 Model

This tutorial is in four parts; they are: • The Core Text Generation Implementation • Contrastive Search: What are the Parameters in Text Generation? • Batch Processing and Padding • Tips for Better Generation Results Let’s start with a basic implementation that demonstrates the fundamental concept.

Picture1 8

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. To achieve optimal performance for specific use cases, customers are adopting and adapting these FMs to their unique domain requirements. This need for customization has become even more pronounced with the emergence of new models, such as those released …

image1 zd4iHbn.max 1000x1000 1

Use Gemini 2.0 to speed up document extraction and lower costs

A few weeks ago, Google DeepMind released Gemini 2.0 for everyone, including Gemini 2.0 Flash, Gemini 2.0 Flash-Lite, and Gemini 2.0 Pro (Experimental). All models support up to at least 1 million input tokens, which makes it easier to do a lot of things – from image generation to creative writing. It’s also changed how …