research

Creating a Llama or GPT Model for Next-Token Prediction

This article is divided into three parts; they are: • Understanding the Architecture of Llama or GPT Model • Creating…

1 month ago

Top 5 Agentic AI LLM Models

In 2025, “using AI” no longer just means chatting with a model, and you’ve probably already noticed that shift yourself.

1 month ago

Training a Tokenizer for Llama Model

Let's get started.

1 month ago

Prompt Engineering for Time Series Analysis

Strange as it may sound, large language models (LLMs) can be leveraged for data analysis tasks, including specific scenarios such…

1 month ago

The Complete Guide to Using Pydantic for Validating LLM Outputs

Large language models generate text, not structured data.

1 month ago

The Roadmap for Mastering Agentic AI in 2026

Agentic AI is changing how we interact with machines.

1 month ago

Prompt Compression for LLM Generation Optimization and Cost Reduction

Large language models (LLMs) are mainly trained to generate text responses to user queries or prompts, with complex reasoning under…

1 month ago

How to Speed-Up Training of Language Models

This article is divided into four parts; they are: • Optimizers for Training Language Models • Learning Rate Schedulers •…

2 months ago

Fine-Tuning a BERT Model

This article is divided into two parts; they are: • Fine-tuning a BERT Model for GLUE Tasks • Fine-tuning a…

2 months ago

The Journey of a Token: What Really Happens Inside a Transformer

Large language models (LLMs) are based on the transformer architecture, a complex deep neural network whose input is a sequence…

2 months ago