research

Train, Serve, and Deploy a Scikit-learn Model with FastAPI

FastAPI has become one of the most popular ways to serve machine learning models because it is lightweight, fast, and…

2 hours ago

AI Agent Memory Explained in 3 Levels of Difficulty

A stateless AI agent has no memory of previous calls.

1 day ago

Getting Started with Zero-Shot Text Classification

Zero-shot text classification is a way to label text without first training a classifier on your own task-specific dataset.

2 days ago

Gradient-based Planning for World Models at Longer Horizons

GRASP is a new gradient-based planner for learned dynamics (a “world model”) that makes long-horizon planning practical by (1) lifting…

2 days ago

The Complete Guide to Inference Caching in LLMs

Calling a large language model API at scale is expensive and slow.

5 days ago

Python Decorators for Production Machine Learning Engineering

You've probably written a decorator or two in your Python career.

6 days ago

Structured Outputs vs. Function Calling: Which Should Your Agent Use?

Language models (LMs), at their core, are text-in and text-out systems.

1 week ago

How to Implement Tool Calling with Gemma 4 and Python

The open-weights model ecosystem shifted recently with the release of the

1 week ago