research

Dealing with Missing Data Strategically: Advanced Imputation Techniques in Pandas and Scikit-learn

Missing values appear more often than not in many real-world datasets.

8 months ago

Loss Functions Explained: Understand the Maths in Just 2 Minutes Each

I must say, with the ongoing hype around machine learning, a lot of people jump straight to the application side…

8 months ago

10 MLOps Tools for Machine Learning Practitioners to Know

Machine learning is not just about building models.

8 months ago

NumPy Ninjutsu: Mastering Array Operations for High-Performance Machine Learning

Machine learning workflows typically involve plenty of numerical computations in the form of mathematical and algebraic operations upon data stored…

8 months ago

10 Python One-Liners That Will Simplify Feature Engineering

Feature engineering is a key process in most data analysis workflows, especially when constructing machine learning models.

8 months ago

Word Embeddings in Language Models

This post is divided into three parts; they are: • Understanding Word Embeddings • Using Pretrained Word Embeddings • Training…

8 months ago

A Gentle Introduction to SHAP for Tree-Based Models

Machine learning models have become increasingly sophisticated, but this complexity often comes at the cost of interpretability.

8 months ago

Using Quantized Models with Ollama for Application Development

Quantization is a frequently used strategy applied to production machine learning models, particularly large and complex ones, to make them…

8 months ago

Tokenizers in Language Models

This post is divided into five parts; they are: • Naive Tokenization • Stemming and Lemmatization • Byte-Pair Encoding (BPE)…

8 months ago

10 Python Libraries That Speed Up Model Development

Machine learning model development often feels like navigating a maze, exciting but filled with twists, dead ends, and time sinks.

8 months ago