research

Pretraining a Llama Model on Your Local GPU

This article is divided into three parts; they are: • Training a Tokenizer with Special Tokens • Preparing the Training…

3 weeks ago

Rotary Position Embeddings for Long Context Length

This article is divided into two parts; they are: • Simple RoPE • RoPE for Long Context Length Compared to…

3 weeks ago

5 Agentic Coding Tips & Tricks

Agentic coding only feels "smart" when it ships correct diffs, passes tests, and leaves a paper trail you can trust.

3 weeks ago

How to Fine-Tune a Local Mistral or Llama 3 Model on Your Own Dataset

Large language models (LLMs) like Mistral 7B and Llama 3 8B have shaken the AI field, but their broad nature…

3 weeks ago

Top 5 Vector Databases for High-Performance LLM Applications

Building AI applications often requires searching through millions of documents, finding similar items in massive catalogs, or retrieving relevant context…

4 weeks ago

Transformer vs LSTM for Time Series: Which Works Better?

From daily weather measurements or traffic sensor readings to stock prices, time series data are present nearly everywhere.

4 weeks ago

The Machine Learning Engineer’s Checklist: Best Practices for Reliable Models

Building newly trained machine learning models that work is a relatively straightforward endeavor, thanks to mature frameworks and accessible computing…

4 weeks ago

How LLMs Choose Their Words: A Practical Walk-Through of Logits, Softmax and Sampling

This article is divided into four parts; they are: • How Logits Become Probabilities • Temperature • Top- k Sampling…

4 weeks ago

3 Feature Engineering Techniques for Unstructured Text Data

Machine learning models possess a fundamental limitation that often frustrates newcomers to natural language processing (NLP): they cannot read.

4 weeks ago

3 Subtle Ways Data Leakage Can Ruin Your Models (and How to Prevent It)

Data leakage is an often accidental problem that may happen in machine learning modeling.

4 weeks ago