research

Implementing Vector Search from Scratch: A Step-by-Step Tutorial

There’s no doubt that search is one of the most fundamental problems in computing.

9 months ago

How to Optimize Language Model Size for Deployment

The rise of language models, and more specifically large language models (LLMs), has been of such a magnitude that it…

9 months ago

Dealing with Missing Data Strategically: Advanced Imputation Techniques in Pandas and Scikit-learn

Missing values appear more often than not in many real-world datasets.

9 months ago

Loss Functions Explained: Understand the Maths in Just 2 Minutes Each

I must say, with the ongoing hype around machine learning, a lot of people jump straight to the application side…

9 months ago

10 MLOps Tools for Machine Learning Practitioners to Know

Machine learning is not just about building models.

9 months ago

NumPy Ninjutsu: Mastering Array Operations for High-Performance Machine Learning

Machine learning workflows typically involve plenty of numerical computations in the form of mathematical and algebraic operations upon data stored…

9 months ago

10 Python One-Liners That Will Simplify Feature Engineering

Feature engineering is a key process in most data analysis workflows, especially when constructing machine learning models.

9 months ago

Word Embeddings in Language Models

This post is divided into three parts; they are: • Understanding Word Embeddings • Using Pretrained Word Embeddings • Training…

9 months ago

A Gentle Introduction to SHAP for Tree-Based Models

Machine learning models have become increasingly sophisticated, but this complexity often comes at the cost of interpretability.

9 months ago

Using Quantized Models with Ollama for Application Development

Quantization is a frequently used strategy applied to production machine learning models, particularly large and complex ones, to make them…

9 months ago