ML Pipeline Design

Intro

A production ML pipeline runs from raw data to a monitored model in production. The non-negotiable property is reproducibility: given the same data version, feature config, and hyperparameters, you get the same model. Without that, debugging is impossible six months later.

Overview

Pipeline stages

Data collection -> Data versioning -> Feature engineering -> Training
    -> Evaluation -> Model registry -> Deployment -> Monitoring

Every stage has to be reproducible and every artifact has to be traceable back to its inputs. See references/pipeline-stages.md for stage-by-stage tools, best practices, and pitfalls.

Data versioning

Track datasets like code. Every training run must reference an exact data version.

ML Pipeline Design

Intro

Overview

Pipeline stages

Data collection -> Data versioning -> Feature engineering -> Training
    -> Evaluation -> Model registry -> Deployment -> Monitoring

Every stage has to be reproducible and every artifact has to be traceable back to its inputs. See references/pipeline-stages.md for stage-by-stage tools, best practices, and pitfalls.

Data versioning

Track datasets like code. Every training run must reference an exact data version.

Pattern	Description	Use when
Shadow	New model runs alongside old, results compared but not served	Validating with real traffic, no user impact
Canary	New model serves 5–10% of traffic, ramping up	Limiting blast radius
A/B test	Two models serve different segments, measure business metrics	Comparing on real outcomes
Blue-green	Two identical environments, switch traffic at once	Need instant rollback
Feature flag	Model version gated by flag system	Gradual rollout, easy kill switch

Tool	Type	Best for
Apache Airflow	General DAG orchestration	Complex workflows, many integrations
Kubeflow Pipelines	K8s-native ML pipelines	Teams already on Kubernetes
Vertex AI Pipelines	Managed (GCP)	GCP-native teams, minimal ops
SageMaker Pipelines	Managed (AWS)	AWS-native teams, minimal ops
Prefect / Dagster	Modern Python orchestration	Python-first teams, better DX than Airflow
ZenML	ML-specific orchestration	Teams wanting an ML pipeline abstraction

Ml Pipeline

ML Pipeline Design

Intro

Overview

Pipeline stages

Data versioning

Ml Pipeline

ML Pipeline Design

Intro

Overview

Pipeline stages

Data versioning

Feature engineering and feature stores

Experiment tracking

Model registry

Deployment patterns

Monitoring and drift

CI/CD for ML

Gotchas

Full reference

Pipeline orchestration tools

Stage-by-stage gotchas

Anti-patterns

Worked scenarios

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns