Data quality framework — completeness, accuracy, consistency, validation, and contracts. Use when implementing data validation, setting up quality checks for a pipeline, defining data contracts between teams, or investigating data anomalies.
Data quality is measurable across six dimensions and enforced with explicit validation rules at every pipeline boundary. Codify producer and consumer expectations as data contracts and run them as automated checks rather than relying on manual review.
Evaluate data against:
Schema validation enforces column names, types, and nullability. Range checks bound numerics (age 0–150, price >= 0). Format checks enforce ISO 8601 dates, email regexes, normalized phone numbers. Referential integrity confirms foreign keys resolve. Business rules express domain truths (order total = sum of line items). Freshness checks confirm the latest record is within the expected window. Categorize each rule by