A hands-on walkthrough of the synthdata skills. The tutorial covers five core skills hands-on (generate, extract, extend, anonymize, serve) and introduces the compute and prompt-builder skills at the end. Rather than explaining features abstractly, the user works with real datasets as they go — each step producing a file they can inspect. The arc: tour templates → generate from template → write a custom schema → extract → extend → anonymize → serve as MCP server.

Prerequisites

Check that the Python dependencies are installed:

python3 -c "import openpyxl, faker, numpy, pandas, yaml, mcp" 2>&1 && echo "DEPS_READY" || echo "DEPS_MISSING"

If DEPS_MISSING, tell the user to run:

pip install openpyxl faker numpy pandas pyyaml mcp --break-system-packages

Pick a scratch directory for tutorial outputs (default: /tmp/synthdata-tutorial/). Create it with before Step 1.

Prerequisites

Check that the Python dependencies are installed:

python3 -c "import openpyxl, faker, numpy, pandas, yaml, mcp" 2>&1 && echo "DEPS_READY" || echo "DEPS_MISSING"

If DEPS_MISSING, tell the user to run:

pip install openpyxl faker numpy pandas pyyaml mcp --break-system-packages

Pick a scratch directory for tutorial outputs (default: /tmp/synthdata-tutorial/). Create it with before Step 1.

Synthdata Interactive Tutorial

Prerequisites

Synthdata Interactive Tutorial

Prerequisites

Starting the Tutorial

The Tutorial Arc

Step 1: Tour the templates — "What can I generate out of the box?"

Step 2: Custom schema — "Write your own YAML"

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc