Skill ファイル

DynamoDB Design Patterns

Name: DynamoDB Design Patterns
Author: srijan-at-qwertystars

DynamoDB data modeling and design patterns for AWS applications. Use when: DynamoDB table design, single-table design, DynamoDB GSI, partition key strategy, sort key design, DynamoDB query optimization, NoSQL data modeling, DynamoDB streams, DynamoDB TTL, DynamoDB transactions, batch operations, DAX caching, capacity planning, access pattern modeling, DynamoDB CDC, item collection, secondary index design, write sharding. Do NOT use when: relational database design, SQL queries, PostgreSQL schema, MySQL optimization, MongoDB queries, Redis caching, Elasticsearch indexing, general SQL joins, Oracle tuning, Cassandra ring design.

srijan-at-qwertystars0 スター2026/03/24

職業
カテゴリ: SQL データベース

スキル内容

Core Principles

DynamoDB is a key-value and document store. Design for access patterns, not entities.
Identify ALL access patterns before writing any schema. You cannot retrofit efficiently.
Denormalize aggressively. Store data the way it will be read. Duplicate if needed.
There are no JOINs. Pre-join data at write time.
Item size limit: 400KB. Partition limit: 10GB (with LSI). Max 25 GSIs per table.

Single-Table Design

Store multiple entity types in one table using generic key names (PK, SK). Prefix values with entity type for disambiguation.

Key structure pattern

PK                  | SK                      | Entity
USER#u123           | METADATA                | User profile
USER#u123           | ORDER#2024-03-15#o456   | User's order
USER#u123           | ORDER#2024-03-15#o456#ITEM#i789 | Order item
ORG#org1            | USER#u123               | Org membership

DynamoDB Design Patterns

srijan-at-qwertystars0 スター2026/03/24

職業
カテゴリ: SQL データベース

スキル内容

Core Principles

DynamoDB is a key-value and document store. Design for access patterns, not entities.
Identify ALL access patterns before writing any schema. You cannot retrofit efficiently.
Denormalize aggressively. Store data the way it will be read. Duplicate if needed.
There are no JOINs. Pre-join data at write time.
Item size limit: 400KB. Partition limit: 10GB (with LSI). Max 25 GSIs per table.

Single-Table Design

Store multiple entity types in one table using generic key names (PK, SK). Prefix values with entity type for disambiguation.

Key structure pattern

PK                  | SK                      | Entity
USER#u123           | METADATA                | User profile
USER#u123           | ORDER#2024-03-15#o456   | User's order
USER#u123           | ORDER#2024-03-15#o456#ITEM#i789 | Order item
ORG#org1            | USER#u123               | Org membership

関連 Skill

Pattern	PK Example	Use Case
Entity ID	`USER#u123`	Direct lookups
Composite	`TENANT#t1#USER#u123`	Multi-tenant isolation
Write sharding	`VOTES#item1#3` (append 0-N)	Hot partition mitigation
Time-bucketed	`LOGS#2024-03-15`	Time-series with known ranges

import random
SHARD_COUNT = 10
pk = f"COUNTER#{item_id}#{random.randint(0, SHARD_COUNT - 1)}"
# Read: query all shards and aggregate

Pattern	SK Example	Enables
Hierarchical	`COUNTRY#US#STATE#CA#CITY#LA`	begins_with at any level
Timestamp	`ORDER#2024-03-15T10:30:00Z`	Range queries on time
Version	`v0` (current), `v1`, `v2`	Version history
Composite	`STATUS#active#DATE#2024-03-15`	Filter by status + time
Zero-padded	`RANK#000042`	Numeric sort as strings

GSI1PK              | GSI1SK                  | Use
[email protected]      | USER                    | Lookup user by email
ORG#org1             | USER#u123               | List users in org
STATUS#active        | DATE#2024-03-15         | Active items by date

Table:  PK=USER#u123,  SK=ORDER#o456
GSI:    PK=ORDER#o456, SK=USER#u123   → "which user placed this order?"

Access Pattern	Key Design
Get user profile	PK=`USER#u123`, SK=`METADATA`
List user orders (newest first)	PK=`USER#u123`, SK=`begins_with("ORDER#")`, ScanIndexForward=false
Get order details + items	PK=`ORDER#o456`, SK=`begins_with("")`
Orders by status	GSI1PK=`STATUS#shipped`, GSI1SK=`DATE#2024-03-15`
Lookup by email	GSI2PK=`[email protected]`, GSI2SK=`USER`

import time

# Session expiry (24 hours)
item["ttl"] = int(time.time()) + 86400

# Soft delete: set TTL to 30 days, archive via Stream before deletion
item["ttl"] = int(time.time()) + (30 * 86400)
item["deleted"] = True

# Rolling window: keep only last 90 days of events
event["ttl"] = int(time.time()) + (90 * 86400)

client.transact_write_items(
    TransactItems=[
        {
            "Put": {
                "TableName": "app",
                "Item": {"PK": {"S": "ORDER#o789"}, "SK": {"S": "METADATA"}, ...},
                "ConditionExpression": "attribute_not_exists(PK)"  # idempotency
            }
        },
        {
            "Update": {
                "TableName": "app",
                "Key": {"PK": {"S": "USER#u123"}, "SK": {"S": "METADATA"}},
                "UpdateExpression": "SET orderCount = orderCount + :one",
                "ExpressionAttributeValues": {":one": {"N": "1"}}
            }
        }
    ]
)

import time

def batch_write_with_retry(table, items, max_retries=5):
    unprocessed = items
    for attempt in range(max_retries):
        response = table.batch_write_item(RequestItems=unprocessed)
        unprocessed = response.get("UnprocessedItems", {})
        if not unprocessed:
            return
        time.sleep(2 ** attempt * 0.1)  # exponential backoff
    raise Exception("Failed to process all items")

import amazondax
dax_client = amazondax.AmazonDaxClient(endpoints=["dax-cluster.abc123.dax-clusters.us-east-1.amazonaws.com:8111"])
response = dax_client.get_item(TableName="app", Key={"PK": {"S": "USER#u123"}, "SK": {"S": "METADATA"}})

Anti-Pattern	Problem	Fix
Scan for queries	O(n) cost, reads entire table	Use Query with proper key design
Low-cardinality PK	Hot partitions, throttling	Use high-cardinality keys, add sharding
Read-before-write	2x capacity, race conditions	Use ConditionExpression or UpdateExpression
One table per entity	Cannot fetch related data efficiently	Single-table design with shared PK
Missing projections	Wastes RCU on unneeded attributes	Always set ProjectionExpression
Large items (>100KB)	Slow reads, high RCU cost	Compress or move large data to S3
Relational modeling	Normalized tables need multiple queries	Denormalize, duplicate data at write time
Ignoring GSI cost	Each GSI replicates all writes	Only create GSIs for real access patterns
No retry on unprocessed	Silent data loss in batch ops	Always handle UnprocessedItems/Keys
Filter instead of key design	Reads then discards data	Push filtering into key/index design

GetItem          → exact PK + SK
Query            → exact PK + SK condition (=, <, >, between, begins_with)
Scan             → avoid; full table read
PutItem          → write single item (upsert)
UpdateItem       → partial update with expressions
DeleteItem       → remove by PK + SK
BatchGetItem     → up to 100 GetItem calls
BatchWriteItem   → up to 25 Put/Delete calls
TransactGetItems → up to 100 items, ACID reads
TransactWriteItems → up to 100 items, ACID writes

PK                  | SK                        | GSI1PK           | GSI1SK              | Attrs
TENANT#t1           | METADATA                  | —                | —                   | name, plan, createdAt
TENANT#t1           | USER#u1                   | USER#u1          | TENANT#t1           | email, role
TENANT#t1           | PROJECT#p1                | STATUS#active    | DATE#2024-03-15     | title, owner
USER#u1             | METADATA                  | EMAIL#[email protected]    | USER                | name, avatar
USER#u1             | SESSION#s1                | —                | —                   | token, ttl
PROJECT#p1          | METADATA                  | —                | —                   | title, description
PROJECT#p1          | TASK#2024-03-15#tk1        | ASSIGNEE#u1      | DUE#2024-03-20      | title, status

scripts/table-design.sh — Interactive CLI to scaffold a DynamoDB table definition. Prompts for table name, keys, billing mode, GSIs, TTL, Streams, and PITR. Outputs CloudFormation YAML, CDK TypeScript, or Terraform HCL. Supports --non-interactive mode for automation.
```
./scripts/table-design.sh                    # interactive
./scripts/table-design.sh --output cdk       # CDK output
./scripts/table-design.sh --output terraform  # Terraform output
```
scripts/capacity-calculator.sh — Calculate RCU/WCU requirements and estimated monthly cost based on item size, read/write rates, consistency mode, and GSI count. Compares provisioned vs on-demand pricing.
```
./scripts/capacity-calculator.sh --item-size 2.5 --reads 500 --writes 200 --consistency eventual
./scripts/capacity-calculator.sh --item-size 4 --reads 1000 --writes 100 --gsi-count 3
```

scripts/scan-table.sh — Parallel scan a DynamoDB table with progress tracking. Supports filtering, projection, rate limiting, and JSON output. Requires AWS CLI and jq.

./scripts/scan-table.sh --table MyTable --segments 10 --output results.json
./scripts/scan-table.sh --table MyTable --filter "status = :s" --values '{":s":{"S":"active"}}'

DynamoDB Design Patterns

Core Principles

Single-Table Design

Key structure pattern

DynamoDB Design Patterns

Core Principles

Single-Table Design

Key structure pattern

When to use single-table vs multi-table

Partition Key Strategies

Requirements for good partition keys

Patterns

Write sharding for hot partitions

Sort Key Strategies

Patterns

Sort key query operators

GSI (Global Secondary Index) Design

Design rules

GSI overloading

Inverted index pattern

LSI (Local Secondary Index) Design

Access Pattern Modeling

Step-by-step process

Example: E-commerce

DynamoDB Streams and CDC

Stream view types

Use cases

Best practices

TTL (Time To Live) Patterns

Patterns

Important notes

Transactions

Use cases

Example: Idempotent order creation

Cost and limits

Batch Operations

BatchWriteItem

BatchGetItem

Example: Batch write with retry

Capacity Planning

On-demand mode

Provisioned mode

Cost optimization tactics

DAX (DynamoDB Accelerator)

When to use

When NOT to use

Configuration

Common Anti-Patterns

Quick Reference: API to Key Mapping

Example: Complete Single-Table Schema (SaaS App)

References

Scripts

Assets

Postgres Patterns

Postgres Patterns

Database Migrations

Postgres Patterns

Postgres Patterns

Jpa Patterns