Search skills.../

Butler Db Schema | Skills Pool

target_metadata=None

PostgreSQL database: "butlers"
├── public          # Extensions + cross-butler tables (identity, model catalog, etc.)
├── general         # General butler's domain tables
├── health          # Health butler's domain tables
├── messenger       # Messenger butler's domain tables
├── relationship    # Relationship butler's domain tables
├── switchboard     # Switchboard butler's domain tables
└── public          # Extensions (pgcrypto, vector, uuid-ossp)

SET search_path TO <own_schema>, public

Table	Purpose	Primary access pattern
`state`	Key-value JSONB store	Point lookups by key, prefix scans
`sessions`	Runtime invocation history & trace metadata	Recent-first, lookup by request_id
`scheduled_tasks`	Cron-driven recurring prompts + job dispatch	Query enabled + due tasks
`route_inbox`	Accept-then-process inbox for route requests	Filter by lifecycle_state
`butler_secrets`	Encrypted secrets store (tokens, API keys)	Lookup by secret_key, filter by category

CREATE TABLE state (
    key TEXT PRIMARY KEY,
    value JSONB NOT NULL DEFAULT '{}'::jsonb,
    updated_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    version INTEGER NOT NULL DEFAULT 1
);

-- Prefix scans for namespaced keys (e.g., "module:email:%")
CREATE INDEX idx_state_key_prefix ON state (key text_pattern_ops);

CREATE TABLE sessions (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    prompt TEXT NOT NULL,
    trigger_source TEXT NOT NULL,          -- 'schedule:<task-name>', 'tick', 'external', 'trigger'
    model TEXT,
    success BOOLEAN,
    error TEXT,
    result TEXT,
    tool_calls JSONB NOT NULL DEFAULT '[]'::jsonb,
    duration_ms INTEGER,
    trace_id TEXT,
    request_id TEXT,
    cost JSONB,
    input_tokens INTEGER,
    output_tokens INTEGER,
    parent_session_id UUID,
    started_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    completed_at TIMESTAMPTZ
);

CREATE INDEX idx_sessions_request_id ON sessions (request_id);

CREATE TABLE scheduled_tasks (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    name TEXT NOT NULL UNIQUE,
    cron TEXT NOT NULL,
    prompt TEXT,                                -- Required for prompt mode, NULL for job mode
    dispatch_mode TEXT NOT NULL DEFAULT 'prompt',
    job_name TEXT,                              -- Required for job mode
    job_args JSONB,
    timezone TEXT NOT NULL DEFAULT 'UTC',
    start_at TIMESTAMPTZ,                      -- Window start (optional)
    end_at TIMESTAMPTZ,                        -- Window end (optional)
    until_at TIMESTAMPTZ,                      -- Expiry date (optional)
    display_title TEXT,
    calendar_event_id UUID,                    -- FK to calendar_events for linked events
    source TEXT NOT NULL DEFAULT 'db',         -- 'toml' or 'db'
    enabled BOOLEAN NOT NULL DEFAULT true,
    next_run_at TIMESTAMPTZ,
    last_run_at TIMESTAMPTZ,
    last_result JSONB,
    created_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    updated_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    CONSTRAINT scheduled_tasks_dispatch_mode_check
        CHECK (dispatch_mode IN ('prompt', 'job')),
    CONSTRAINT scheduled_tasks_dispatch_payload_check
        CHECK (
            (dispatch_mode = 'prompt' AND prompt IS NOT NULL AND job_name IS NULL)
            OR (dispatch_mode = 'job' AND job_name IS NOT NULL)
        ),
    CONSTRAINT scheduled_tasks_window_bounds_check
        CHECK (start_at IS NULL OR end_at IS NULL OR end_at > start_at),
    CONSTRAINT scheduled_tasks_until_bounds_check
        CHECK (until_at IS NULL OR start_at IS NULL OR until_at >= start_at)
);

CREATE UNIQUE INDEX ix_scheduled_tasks_calendar_event_id
    ON scheduled_tasks (calendar_event_id)
    WHERE calendar_event_id IS NOT NULL;

CREATE TABLE route_inbox (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    received_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    route_envelope JSONB NOT NULL,
    lifecycle_state TEXT NOT NULL DEFAULT 'accepted',
    processed_at TIMESTAMPTZ,
    session_id UUID,
    error TEXT
);

CREATE INDEX idx_route_inbox_lifecycle_state
    ON route_inbox (lifecycle_state, received_at);

CREATE TABLE butler_secrets (
    secret_key TEXT PRIMARY KEY,
    secret_value TEXT NOT NULL,
    category TEXT NOT NULL DEFAULT 'general',
    description TEXT,
    is_sensitive BOOLEAN NOT NULL DEFAULT true,
    created_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    updated_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    expires_at TIMESTAMPTZ
);

CREATE INDEX ix_butler_secrets_category ON butler_secrets (category);

Table	Purpose
`calendar_sources`	Calendar provider sources with lane (user/butler)
`calendar_events`	Base events with recurrence rules
`calendar_event_instances`	Expanded recurring event occurrences
`calendar_sync_cursors`	Incremental sync state per source
`calendar_action_log`	Idempotent mutation audit trail

Table	Purpose
`episodes`	Session memory snapshots with embeddings
`facts`	Persistent structured knowledge (subject/predicate/content)
`rules`	Learned behavioral rules with effectiveness tracking
`memory_links`	Cross-type relationships between memory entities

-- Example: facts table (key columns)
CREATE TABLE facts (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    subject TEXT NOT NULL,
    predicate TEXT NOT NULL,
    content TEXT NOT NULL,
    embedding vector(384),
    search_vector tsvector,
    importance FLOAT NOT NULL DEFAULT 5.0,
    confidence FLOAT NOT NULL DEFAULT 1.0,
    decay_rate FLOAT NOT NULL DEFAULT 0.008,
    permanence TEXT NOT NULL DEFAULT 'standard',
    validity TEXT NOT NULL DEFAULT 'active',
    scope TEXT NOT NULL DEFAULT 'global',
    reference_count INTEGER NOT NULL DEFAULT 0,
    tags JSONB DEFAULT '[]'::jsonb,
    metadata JSONB DEFAULT '{}'::jsonb,
    created_at TIMESTAMPTZ NOT NULL DEFAULT now(),
    last_referenced_at TIMESTAMPTZ
);

CREATE INDEX idx_facts_subject_predicate ON facts (subject, predicate);
CREATE INDEX idx_facts_scope_validity ON facts (scope, validity) WHERE validity = 'active';
CREATE INDEX idx_facts_search ON facts USING gin(search_vector);
CREATE INDEX idx_facts_tags ON facts USING gin(tags);
CREATE INDEX idx_facts_embedding ON facts USING ivfflat (embedding vector_cosine_ops) WITH (lists = 20);

Every timestamp column used in WHERE or ORDER BY gets a descending index. Butlers almost always want "most recent first."
```
CREATE INDEX idx_<table>_<col> ON <table> (<col> DESC);
```
Compound indexes for filtered recency queries. If you filter by a category and sort by time:
```
CREATE INDEX idx_<table>_<filter>_recent ON <table> (<filter_col>, <time_col> DESC);
```

GIN indexes for JSONB columns you search inside. Use jsonb_path_ops for containment queries (@>), plain GIN if you also need key-existence checks (?, ?|):

CREATE INDEX idx_<table>_<col>_gin ON <table> USING GIN (<col> jsonb_path_ops);
-- or plain GIN (established pattern in codebase):
CREATE INDEX idx_<table>_<col>_gin ON <table> USING GIN (<col>);

GIN indexes for JSONB array columns:

CREATE INDEX idx_<table>_tags_gin ON <table> USING GIN (tags);

Partial indexes for hot subsets. If you frequently query only active items or pending tasks:

CREATE INDEX idx_<table>_active ON <table> (<col>) WHERE status = 'active';
CREATE INDEX idx_tasks_due ON scheduled_tasks (next_run_at) WHERE enabled = true;

GiST indexes for time-range overlap queries (used by calendar):

CREATE INDEX idx_<table>_time_window_gist
    ON <table> USING GIST (tstzrange(starts_at, ends_at, '[)'));

IVFFLAT indexes for vector embeddings (used by memory):

CREATE INDEX idx_<table>_embedding
    ON <table> USING ivfflat (embedding vector_cosine_ops) WITH (lists = 20);

Don't index columns you never filter or sort on. No index on detail unless you actually run JSONB containment queries against it.

alembic/
  alembic.ini
  env.py                              # Multi-chain discovery + schema-scoped runner
  versions/
    core/                             # Shared core chain (branch_labels=("core",))
      core_001_target_state_baseline.py
      core_002_add_dispatch_mode_columns.py
      core_005_add_calendar_projection_tables.py
      ...

src/butlers/modules/
  memory/migrations/                  # Memory module chain (branch_labels=("memory",))
    001_memory_baseline.py
  approvals/migrations/               # Approvals module chain
    001_create_approvals_tables.py
    002_create_approval_events.py
  contacts/migrations/                # Contacts module chain
    001_contacts_sync_tables.py
  mailbox/migrations/                 # Mailbox module chain
    001_create_mailbox_table.py

roster/
  health/migrations/                  # Health butler chain (branch_labels=("health",))
    001_health_tables.py
  general/migrations/                 # General butler chain (branch_labels=("general",))
    001_general_tables.py
    002_add_entity_tags.py
  relationship/migrations/            # Relationship butler chain
    001_relationship_tables.py
    rel_002a_enrich_interactions.py
    ...
  messenger/migrations/               # Messenger butler chain
    msg_001_create_delivery_tables.py
  switchboard/migrations/             # Switchboard butler chain
    001_switchboard_tables.py
    002_extraction_tables.py
    ...

"""<Short description of what this migration does>.

Revision ID: <prefix>_<number>
Revises:
Create Date: YYYY-MM-DD HH:MM:SS.000000
"""

from __future__ import annotations

from alembic import op

# revision identifiers, used by Alembic.
revision = "<prefix>_001"      # e.g., "health_001", "mem_001", "core_005"
down_revision = None            # None for first migration in chain, else previous revision
branch_labels = ("<chain>",)    # Only on first migration in a chain (e.g., ("health",))
depends_on = None               # Cross-chain dependency (e.g., "core_001")


def upgrade() -> None:
    # Raw SQL via op.execute() — NO SQLAlchemy ORM operations
    op.execute("""
        CREATE TABLE IF NOT EXISTS example (
            id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
            name TEXT NOT NULL,
            data JSONB NOT NULL DEFAULT '{}'::jsonb,
            created_at TIMESTAMPTZ NOT NULL DEFAULT now(),
            updated_at TIMESTAMPTZ NOT NULL DEFAULT now()
        )
    """)
    op.execute("""
        CREATE INDEX IF NOT EXISTS idx_example_name
        ON example (name)
    """)


def downgrade() -> None:
    op.execute("DROP INDEX IF EXISTS idx_example_name")
    op.execute("DROP TABLE IF EXISTS example")

"""create_finance_tables

Revision ID: finance_001
Revises:
Create Date: 2026-02-23 00:00:00.000000
"""

from __future__ import annotations

from alembic import op

revision = "finance_001"
down_revision = None
branch_labels = ("finance",)
depends_on = None


def upgrade() -> None:
    op.execute("""
        CREATE TABLE IF NOT EXISTS accounts (
            id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
            name TEXT NOT NULL UNIQUE,
            account_type TEXT NOT NULL,
            currency TEXT NOT NULL DEFAULT 'USD',
            metadata JSONB NOT NULL DEFAULT '{}'::jsonb,
            created_at TIMESTAMPTZ NOT NULL DEFAULT now(),
            updated_at TIMESTAMPTZ NOT NULL DEFAULT now()
        )
    """)
    op.execute("""
        CREATE TABLE IF NOT EXISTS transactions (
            id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
            account_id UUID NOT NULL REFERENCES accounts(id) ON DELETE CASCADE,
            amount NUMERIC(12,2) NOT NULL,
            description TEXT,
            category TEXT,
            metadata JSONB NOT NULL DEFAULT '{}'::jsonb,
            occurred_at TIMESTAMPTZ NOT NULL,
            created_at TIMESTAMPTZ NOT NULL DEFAULT now()
        )
    """)
    op.execute("""
        CREATE INDEX IF NOT EXISTS idx_transactions_account_occurred
        ON transactions (account_id, occurred_at DESC)
    """)
    op.execute("""
        CREATE INDEX IF NOT EXISTS idx_transactions_category_occurred
        ON transactions (category, occurred_at DESC)
    """)


def downgrade() -> None:
    op.execute("DROP TABLE IF EXISTS transactions")
    op.execute("DROP TABLE IF EXISTS accounts")

Operation	Safe?	How to do it safely
Add a table	Yes	`CREATE TABLE IF NOT EXISTS`. Old code ignores it.
Add a nullable column	Yes	`ALTER TABLE ADD COLUMN ... DEFAULT NULL`. Old code ignores it.
Add a column with a default	Yes	`ALTER TABLE ADD COLUMN ... DEFAULT <value>`. Old code ignores it.
Add an index	Yes	Use `CREATE INDEX CONCURRENTLY` for large tables. See note below.
Drop a column	Two-phase.	Phase 1: Stop reading/writing the column in code. Deploy. Phase 2: Drop column.
Rename a column	Two-phase.	Phase 1: Add new column, backfill, update code. Phase 2: Drop old column.
Drop a table	Two-phase.	Phase 1: Remove all code references. Deploy. Phase 2: Drop table.
Change a column type	Careful.	Add new column, backfill, migrate code, drop old.
Add NOT NULL	Two-phase.	Phase 1: Backfill NULLs, set default in code. Phase 2: `SET NOT NULL`.

def upgrade() -> None:
    op.execute("COMMIT")  # Exit Alembic's transaction
    op.execute("CREATE INDEX CONCURRENTLY IF NOT EXISTS idx_name ON table (col)")

# In env.py run_migrations_online():
if target_schema is not None:
    connection.exec_driver_sql(f"CREATE SCHEMA IF NOT EXISTS {own_schema}")
    connection.exec_driver_sql(f"SET search_path TO {own_schema}, public")

Table	Purpose
`approval_rules`	Pre-approval rules (tool + arg constraints)
`pending_actions`	Actions awaiting approval/execution

Table	Purpose
`contacts_source_accounts`	Registered sync provider accounts
`contacts_sync_state`	Per-account incremental sync cursor
`contacts_source_links`	External-to-local contact provenance

Butler Db Schema

Butler Database Schema Design

Hard Constraints

Butler Db Schema

Butler Database Schema Design

Hard Constraints

Database Topology

Runtime Roles & ACL

Core Tables (Every Butler Schema Gets These)

1. `state` — Key-Value Store

2. `sessions` — Runtime Invocation History

3. `scheduled_tasks` — Cron-Driven Scheduler

4. `route_inbox` — Accept-Then-Process Inbox

5. `butler_secrets` — Secrets Store

Cross-Butler Tables (in `public`)

Calendar Projection Tables

Module Tables

Memory Module (`mem_001`)

Approvals Module

Contacts Module

Butler-Specific Tables

Schema Design Principles

Indexing Strategy

Rules

Naming Convention

Alembic Migration System

Multi-Chain Architecture

Migration Template

Key Conventions

Adding a New Butler's First Migration

Backward Compatibility Rules

Schema-Scoped Migration Execution

Adding a New Butler to Core ACL

What NOT to Do

Postgres Patterns

Postgres Patterns

Database Migrations

Postgres Patterns

Postgres Patterns

Jpa Patterns

Butler Db Schema

Butler Database Schema Design

Hard Constraints

Butler Db Schema

Butler Database Schema Design

Hard Constraints

Database Topology

Runtime Roles & ACL

Core Tables (Every Butler Schema Gets These)

1. state — Key-Value Store

2. sessions — Runtime Invocation History

3. scheduled_tasks — Cron-Driven Scheduler

4. route_inbox — Accept-Then-Process Inbox

5. butler_secrets — Secrets Store

Cross-Butler Tables (in public)

Calendar Projection Tables

Module Tables

Memory Module (mem_001)

Approvals Module

Contacts Module

Butler-Specific Tables

Schema Design Principles

Indexing Strategy

Rules

Naming Convention

Alembic Migration System

Multi-Chain Architecture

Migration Template

Key Conventions

Adding a New Butler's First Migration

Backward Compatibility Rules

Schema-Scoped Migration Execution

Adding a New Butler to Core ACL

What NOT to Do

Postgres Patterns

Postgres Patterns

Database Migrations

Postgres Patterns

Postgres Patterns

Jpa Patterns

1. `state` — Key-Value Store

2. `sessions` — Runtime Invocation History

3. `scheduled_tasks` — Cron-Driven Scheduler

4. `route_inbox` — Accept-Then-Process Inbox

5. `butler_secrets` — Secrets Store

Cross-Butler Tables (in `public`)

Memory Module (`mem_001`)