AI/ML Techniques

How to Optimize Language Model Size for Deployment

The rise of language models, and more specifically large language models (LLMs), has been of such a magnitude that it…

12 months ago

Dealing with Missing Data Strategically: Advanced Imputation Techniques in Pandas and Scikit-learn

Missing values appear more often than not in many real-world datasets.

12 months ago

Loss Functions Explained: Understand the Maths in Just 2 Minutes Each

I must say, with the ongoing hype around machine learning, a lot of people jump straight to the application side…

12 months ago

10 MLOps Tools for Machine Learning Practitioners to Know

Machine learning is not just about building models.

1 year ago

NumPy Ninjutsu: Mastering Array Operations for High-Performance Machine Learning

Machine learning workflows typically involve plenty of numerical computations in the form of mathematical and algebraic operations upon data stored…

1 year ago

10 Python One-Liners That Will Simplify Feature Engineering

Feature engineering is a key process in most data analysis workflows, especially when constructing machine learning models.

1 year ago

Word Embeddings in Language Models

This post is divided into three parts; they are: • Understanding Word Embeddings • Using Pretrained Word Embeddings • Training…

1 year ago

A Gentle Introduction to SHAP for Tree-Based Models

Machine learning models have become increasingly sophisticated, but this complexity often comes at the cost of interpretability.

1 year ago

Using Quantized Models with Ollama for Application Development

Quantization is a frequently used strategy applied to production machine learning models, particularly large and complex ones, to make them…

1 year ago

Tokenizers in Language Models

This post is divided into five parts; they are: • Naive Tokenization • Stemming and Lemmatization • Byte-Pair Encoding (BPE)…

1 year ago