Dockerizing ML apps & multi-stage builds

The Problem: "Works on My Machine"

You've built a fantastic ML application. It runs perfectly on your laptop. You give it to your colleague or deploy it to a server, and... it crashes. Why? Maybe they have a different Python version, a different version of TensorFlow, or a missing system library. This is a classic deployment headache.

The Solution: Docker

Docker solves this by allowing you to package your application and all its dependencies—the code, a specific Python version, all the necessary libraries, and even the model file—into a single, self-contained, and portable unit called a container.

Analogy: A Docker container is like a standardized shipping container for software. It doesn't matter what's inside; the container itself can be moved, loaded, and run on any machine that has Docker installed (your laptop, a colleague's Mac, a cloud server) and it will behave exactly the same way every time.

Creating a Dockerfile

A Dockerfile is a text file that contains a set of instructions for building a Docker image. It's the recipe for your container.

Let's create a Dockerfile for the FastAPI application from the previous tutorial.

requirements.txt:

fastapi
uvicorn
scikit-learn
joblib
pandas

Dockerfile (A simple, single-stage build):

Dockerfile

# 1. Start from an official Python base image
FROM python:3.9-slim

# 2. Set the working directory inside the container
WORKDIR /app

# 3. Copy the dependencies file
COPY requirements.txt .

# 4. Install the dependencies
RUN pip install --no-cache-dir -r requirements.txt

# 5. Copy the rest of the application code and model file
COPY . .

# 6. Expose the port the app runs on
EXPOSE 8000

# 7. Define the command to run the application
CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "8000"]

You would then build and run this container using the commands: docker build -t sentiment-api . docker run -p 8000:8000 sentiment-api

The Problem: Bloated Docker Images

The Dockerfile above works, but it creates a large image. Why? The python:3.9 base image contains the full Python development environment, including compilers and build tools. These are needed to install some packages, but they are not needed to simply run the application. Large images are slower to download and deploy and can have a larger security attack surface.

The Solution: Multi-Stage Builds

A multi-stage build uses multiple FROM instructions in a single Dockerfile. Each FROM instruction begins a new "stage." You can selectively copy artifacts (like installed packages or compiled code) from one stage to another, leaving behind everything you don't need in the final image.

Code Snippet: An Optimized Dockerfile with Multi-Stage Build

Dockerfile

# --- Stage 1: The "Builder" ---
# Use a full-featured image to install dependencies
FROM python:3.9 as builder

WORKDIR /app

# Install dependencies in a virtual environment
RUN python -m venv /opt/venv
ENV PATH="/opt/venv/bin:$PATH"

COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt


# --- Stage 2: The "Runner" ---
# Use a minimal, slim image for the final container
FROM python:3.9-slim

WORKDIR /app

# The magic step: Copy ONLY the installed packages from the builder stage
COPY --from=builder /opt/venv /opt/venv

# Copy the application code and model
COPY . .

# Set the path to use the virtual environment
ENV PATH="/opt/venv/bin:$PATH"

EXPOSE 8000

# Run the application
CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "8000"]

This approach creates a much smaller and more secure final image because it doesn't contain any of the build-time tools from the first stage. It only contains the slim Python runtime, your application code, and the necessary installed libraries.

LearnCodePro

Dockerizing ML apps & multi-stage builds

Creating a Dockerfile

The Problem: Bloated Docker Images

The Solution: Multi-Stage Builds

Saving & loading models (Pickle, TorchScript, SavedModel)

Serving models with Flask / FastAPI (REST + batching)

Model serving platforms (TorchServe, TensorFlow Serving)

CI/CD for ML (unit test models, data tests)

Data versioning (DVC) & experiment tracking (MLflow)

Quick Navigation

This Series

Topics in Data Science, Machine Learning & AI

Categories

Learn More

Want to Track Your Progress?

Creating a Dockerfile

The Problem: Bloated Docker Images

The Solution: Multi-Stage Builds

More in Model Deployment & MLOps

Saving & loading models (Pickle, TorchScript, SavedModel)

Serving models with Flask / FastAPI (REST + batching)

Model serving platforms (TorchServe, TensorFlow Serving)

CI/CD for ML (unit test models, data tests)

Data versioning (DVC) & experiment tracking (MLflow)

Quick Navigation

This Series

Topics in Data Science, Machine Learning & AI

Categories

Learn More

Want to Track Your Progress?