Overview

Design data models that are normalized, performant, scalable, and aligned with business requirements. Good data models are the foundation of reliable systems.

Core principle: Data outlives code - design for the long term.

When to Use

Always:

Creating new database schemas
Adding new entities
Modifying existing models
Designing for scale
Handling complex relationships

Never skip:

"We'll use NoSQL and figure it out later"
"Just add a column"
"Denormalize everything for speed"

The Data Model Cycle

digraph data_model {
    rankdir=LR;
    requirements [label="REQUIREMENTS\nDefine data needs", shape=box, style=filled, fillcolor="#ffcccc"];
    entities [label="ENTITIES\nIdentify objects", shape=box, style=filled, fillcolor="#ccffcc"];
    relationships [label="RELATIONSHIPS\nDefine connections", shape=box, style=filled, fillcolor="#ccccff"];
    normalize [label="NORMALIZE\nReduce redundancy", shape=box, style=filled, fillcolor="#ffffcc"];
    optimize [label="OPTIMIZE\nAdd performance", shape=box, style=filled, fillcolor="#ffcc99"];
    validate [label="VALIDATE\nReview design", shape=box, style=filled, fillcolor="#99ff99"];

    requirements -> entities -> relationships -> normalize -> optimize -> validate;
    validate -> entities [label="iterate"];
}

Overview

Design data models that are normalized, performant, scalable, and aligned with business requirements. Good data models are the foundation of reliable systems.

Core principle: Data outlives code - design for the long term.

When to Use

Always:

Creating new database schemas
Adding new entities
Modifying existing models
Designing for scale
Handling complex relationships

Never skip:

"We'll use NoSQL and figure it out later"
"Just add a column"
"Denormalize everything for speed"

The Data Model Cycle

digraph data_model {
    rankdir=LR;
    requirements [label="REQUIREMENTS\nDefine data needs", shape=box, style=filled, fillcolor="#ffcccc"];
    entities [label="ENTITIES\nIdentify objects", shape=box, style=filled, fillcolor="#ccffcc"];
    relationships [label="RELATIONSHIPS\nDefine connections", shape=box, style=filled, fillcolor="#ccccff"];
    normalize [label="NORMALIZE\nReduce redundancy", shape=box, style=filled, fillcolor="#ffffcc"];
    optimize [label="OPTIMIZE\nAdd performance", shape=box, style=filled, fillcolor="#ffcc99"];
    validate [label="VALIDATE\nReview design", shape=box, style=filled, fillcolor="#99ff99"];

    requirements -> entities -> relationships -> normalize -> optimize -> validate;
    validate -> entities [label="iterate"];
}

Data Model Design

Overview

When to Use

The Data Model Cycle

Data Model Design

Overview

When to Use

The Data Model Cycle

Database Selection

Relational (SQL)

Document (NoSQL)

Key-Value

Column-Family

Graph

Search

Entity Design

Naming Conventions

Standard Fields

Data Types

Relationship Design

One-to-Many

Many-to-Many

One-to-One

Self-Referencing

Normalization

First Normal Form (1NF)

Second Normal Form (2NF)

Third Normal Form (3NF)

When to Denormalize

Indexing Strategy

Primary Keys

Foreign Keys

Search Fields

Composite Indexes

Partial Indexes

Covering Indexes (INCLUDE)

Soft Deletes

Data Integrity

Constraints

Transactions

Schema Versioning

Migration Strategy

Backward Compatibility

Scaling Strategies

Read Replicas

Sharding

Partitioning (PostgreSQL)

Archiving

Data Model Review Checklist

Structure

Performance

Integrity

Standards

Evolution

Common Patterns

Audit Trail

Multi-tenancy

Time-Series

Best Practices

Do's

Don'ts

Integration with AI-DLC

Inception Phase

Construction Phase

Operations Phase

Final Rule

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns