Name: Consolidation Run
Author: racmac57

Execute consolidate_cad_2019_2026.py --full and verify the output.

Arguments

$ARGUMENTS -- Pass --dry-run to only show pre-flight checks without running.

Pre-Flight Checks

Before running, verify these conditions:

1. Config is current

Read config/consolidation_sources.yaml and check:

# Show configured sources and their expected counts
python -c "
import yaml
with open('config/consolidation_sources.yaml') as f:
    cfg = yaml.safe_load(f)
print('=== Yearly Sources ===')
for s in cfg['sources']['yearly']:
    print(f\"  {s['year']}: expected={s.get('expected_records', '?')} - {s['path'].split(chr(92))[-1]}\")
print()
print('=== Monthly Sources ===')
for s in cfg['sources']['monthly']:
    print(f\"  {s['month']}: {s['path'].split(chr(92))[-1]}\")
print()
print(f\"Baseline: {cfg['baseline']['path'].split(chr(92))[-1]}\")
print(f\"Baseline records: {cfg['baseline']['record_count']}\")
print(f\"Date range: {cfg['baseline']['date_range']['start']} to {cfg['baseline']['date_range']['end']}\")
"

Execute consolidate_cad_2019_2026.py --full and verify the output.

Arguments

$ARGUMENTS -- Pass --dry-run to only show pre-flight checks without running.

Pre-Flight Checks

Before running, verify these conditions:

1. Config is current

Read config/consolidation_sources.yaml and check:

# Show configured sources and their expected counts
python -c "
import yaml
with open('config/consolidation_sources.yaml') as f:
    cfg = yaml.safe_load(f)
print('=== Yearly Sources ===')
for s in cfg['sources']['yearly']:
    print(f\"  {s['year']}: expected={s.get('expected_records', '?')} - {s['path'].split(chr(92))[-1]}\")
print()
print('=== Monthly Sources ===')
for s in cfg['sources']['monthly']:
    print(f\"  {s['month']}: {s['path'].split(chr(92))[-1]}\")
print()
print(f\"Baseline: {cfg['baseline']['path'].split(chr(92))[-1]}\")
print(f\"Baseline records: {cfg['baseline']['record_count']}\")
print(f\"Date range: {cfg['baseline']['date_range']['start']} to {cfg['baseline']['date_range']['end']}\")
"

Metric	Threshold	Source
Total records	700,000 - 800,000	`validation.expected_total_records_min/max`
Quality score	>= 95	`validation.min_quality_score`
Duplicate rate	<= 1%	`validation.max_duplicate_rate`

Consolidation Run

Arguments

Pre-Flight Checks

1. Config is current

Consolidation Run

Arguments

Pre-Flight Checks

1. Config is current

2. Mode is full (not incremental)

3. Output directory exists

4. Dependencies available

If `--dry-run`: stop here and report pre-flight results.

Execution

Post-Run Verification

1. Read the run report

2. Validate against thresholds

3. Check for common issues

Report to User

Performance Notes

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns

Consolidation Run

Arguments

Pre-Flight Checks

1. Config is current

Consolidation Run

Arguments

Pre-Flight Checks

1. Config is current

2. Mode is full (not incremental)

3. Output directory exists

4. Dependencies available

If --dry-run: stop here and report pre-flight results.

Execution

Post-Run Verification

1. Read the run report

2. Validate against thresholds

3. Check for common issues

Report to User

Performance Notes

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns

If `--dry-run`: stop here and report pre-flight results.