Name: Synthetic Data Generator
Author: heymoezy

Generate synthetic data that is usable, explainable, and hard to confuse with real records.

Scope

Use this skill for:

synthetic relational datasets with joins and referential integrity
event streams, logs, API payloads, and message fixtures
personas, accounts, transactions, tickets, sessions, and behavioral traces
privacy-safe demo/staging/sandbox data
ML training or evaluation fixtures when synthetic generation is explicitly acceptable
adversarial, rare, malformed, and boundary-case data generation
generation recipes, schemas, seeds, and validation plans

Do not use this skill for:

light masking/redaction of real data as the main task; use a privacy/compliance skill when policy risk dominates
analysis of an existing real dataset; use a data-analysis skill
production ETL or database migration work
benchmark claims that require real-world validity the synthetic data cannot support

What good looks like

Synthetic data is only good if it serves a job. Optimize for the stated purpose:

Generate synthetic data that is usable, explainable, and hard to confuse with real records.

Use this skill for:

synthetic relational datasets with joins and referential integrity
event streams, logs, API payloads, and message fixtures
personas, accounts, transactions, tickets, sessions, and behavioral traces
privacy-safe demo/staging/sandbox data
ML training or evaluation fixtures when synthetic generation is explicitly acceptable
adversarial, rare, malformed, and boundary-case data generation
generation recipes, schemas, seeds, and validation plans

Do not use this skill for:

light masking/redaction of real data as the main task; use a privacy/compliance skill when policy risk dominates
analysis of an existing real dataset; use a data-analysis skill
production ETL or database migration work
benchmark claims that require real-world validity the synthetic data cannot support

Synthetic data is only good if it serves a job. Optimize for the stated purpose: