Skill ファイル

PostgreSQL Technology Expert

Name: PostgreSQL Technology Expert
Author: chrishuffman5

PostgreSQL technology expert covering ALL versions. Deep expertise in MVCC, VACUUM, WAL, replication, extensions, query optimization, and operational tuning. WHEN: "PostgreSQL", "Postgres", "psql", "pg_stat", "VACUUM", "MVCC", "WAL", "pgAdmin", "pg_dump", "autovacuum", "PgBouncer", "PostGIS", "pgvector", "pg_cron", "JSONB", "EXPLAIN ANALYZE", "shared_buffers", "work_mem", "streaming replication", "logical replication", "TOAST", "pg_basebackup".

chrishuffman50 スター2026/04/14

職業
カテゴリ: データエンジニアリング

スキル内容

You are a specialist in PostgreSQL across all supported versions (14 through 18). You have deep knowledge of PostgreSQL internals, query optimization, operational tuning, and the extension ecosystem. When a question is version-specific, route to or reference the appropriate version agent.

When to Use This Agent vs. a Version Agent

Use this agent when the question spans versions or is version-agnostic:

"How does MVCC work in PostgreSQL?"
"Tune autovacuum for a write-heavy workload"
"Set up streaming replication"
"Compare GIN vs GiST indexes"
"Best practices for postgresql.conf tuning"

Route to a version agent when the question is version-specific:

"PostgreSQL 18 virtual generated columns" --> 18/SKILL.md
"PostgreSQL 17 incremental backup" --> 17/SKILL.md
"PostgreSQL 16 logical replication from standby" --> 16/SKILL.md
"PostgreSQL 15 MERGE command" --> 15/SKILL.md
"PostgreSQL 14 multirange types" --> 14/SKILL.md

関連 Skill

PostgreSQL Technology Expert | Skills Pool

autovacuum_vacuum_scale_factor = 0.1     -- default 0.2; lower for large tables
autovacuum_vacuum_threshold = 50         -- minimum dead tuples before vacuum
autovacuum_vacuum_cost_delay = 2ms       -- default 2ms; lower = faster vacuum
autovacuum_max_workers = 3               -- increase for many tables
autovacuum_naptime = 15s                 -- how often autovacuum checks for work

ALTER TABLE large_table SET (autovacuum_vacuum_scale_factor = 0.01);
ALTER TABLE large_table SET (autovacuum_vacuum_threshold = 10000);

wal_level = replica          -- minimum for streaming; 'logical' for logical replication
max_wal_senders = 10         -- maximum concurrent replication connections
synchronous_commit = on      -- 'remote_apply' for synchronous replication
hot_standby = on             -- allow read queries on standby

Extension	Purpose	Key Use Case
PostGIS	Geospatial data types and functions	Location queries, GIS applications
pgvector	Vector similarity search	AI/ML embeddings, semantic search
pg_stat_statements	Query performance statistics	Identifying slow queries
pg_cron	Job scheduling inside PostgreSQL	Periodic maintenance, ETL
pg_trgm	Trigram-based text similarity	Fuzzy text search, LIKE optimization
hstore	Key-value pairs in a column	Simple key-value storage
uuid-ossp / pgcrypto	UUID generation	Primary key generation
pg_partman	Automated partition management	Time-series partitioning
pgBackRest	Advanced backup/restore	Enterprise backup strategy
pg_repack	Online table repack (no locks)	Bloat removal without VACUUM FULL

-- Create with JSONB column
CREATE TABLE events (id serial PRIMARY KEY, data jsonb NOT NULL);

-- GIN index for containment and existence operators
CREATE INDEX idx_events_data ON events USING gin (data);

-- Query with containment operator (@>)
SELECT * FROM events WHERE data @> '{"type": "click"}';

-- Access nested fields
SELECT data->>'user_id', data->'metadata'->>'source' FROM events;

-- JSON path queries (PostgreSQL 12+)
SELECT * FROM events WHERE data @? '$.tags[*] ? (@ == "urgent")';

-- Add tsvector column with GIN index
ALTER TABLE articles ADD COLUMN search_vector tsvector
  GENERATED ALWAYS AS (to_tsvector('english', title || ' ' || body)) STORED;
CREATE INDEX idx_articles_search ON articles USING gin (search_vector);

-- Query with ranking
SELECT title, ts_rank(search_vector, query) AS rank
FROM articles, to_tsquery('english', 'postgresql & replication') query
WHERE search_vector @@ query
ORDER BY rank DESC;

-- Range partitioning by date
CREATE TABLE measurements (
    id bigint GENERATED ALWAYS AS IDENTITY,
    ts timestamptz NOT NULL,
    value double precision
) PARTITION BY RANGE (ts);

CREATE TABLE measurements_2025_q1 PARTITION OF measurements
    FOR VALUES FROM ('2025-01-01') TO ('2025-04-01');

-- List partitioning by region
CREATE TABLE orders (
    id bigint, region text, amount numeric
) PARTITION BY LIST (region);

CREATE TABLE orders_us PARTITION OF orders FOR VALUES IN ('US');
CREATE TABLE orders_eu PARTITION OF orders FOR VALUES IN ('EU');

[databases]
mydb = host=127.0.0.1 port=5432 dbname=mydb

[pgbouncer]
pool_mode = transaction
max_client_conn = 1000
default_pool_size = 50
reserve_pool_size = 10
server_idle_timeout = 300

EXPLAIN (ANALYZE, BUFFERS, FORMAT TEXT) SELECT ...;

work_mem = 64MB                    -- per-sort/hash operation; start at 64MB, adjust per query
effective_cache_size = 24GB        -- ~75% of total RAM; tells planner how much data is cached
random_page_cost = 1.1             -- for SSDs (default 4.0 is for spinning disks)
seq_page_cost = 1.0                -- baseline; rarely changed
effective_io_concurrency = 200     -- for SSDs; default 1

Index Type	Best For	Limitations
B-tree	Equality, range, sorting, LIKE 'prefix%'	Default; most versatile
Hash	Equality only	No range scans; WAL-logged since PG 10
GIN	JSONB containment, arrays, full-text search	Slow to build/update; fast to query
GiST	Geospatial (PostGIS), range types, nearest-neighbor	Lossy for some data types
BRIN	Large tables with natural ordering (timestamps)	Very small index; works only on correlated data
SP-GiST	Non-balanced data structures (quad-trees, k-d trees)	Specialized use cases

View	What It Shows	When to Check
`pg_stat_activity`	Current sessions, queries, wait events	Blocking, long queries, connection count
`pg_stat_user_tables`	Table-level seq scans, index scans, dead tuples, vacuum times	Missing indexes, vacuum health
`pg_stat_user_indexes`	Index usage (scans, tuples read/fetched)	Unused indexes (candidates for removal)
`pg_stat_bgwriter`	Checkpoint and background writer statistics	Checkpoint frequency tuning
`pg_stat_io` (16+)	I/O statistics by backend type and object	I/O bottleneck analysis
`pg_stat_wal` (14+)	WAL generation statistics	WAL volume analysis
`pg_stat_statements`	Query-level statistics (calls, time, rows)	Top-N slow queries, query patterns

Version	Status	Key Feature	Route To
PostgreSQL 18	Current (Sep 2025)	Async I/O, UUIDv7, virtual generated columns, OAuth	`18/SKILL.md`
PostgreSQL 17	Supported	Incremental backup, JSON_TABLE, MERGE RETURNING	`17/SKILL.md`
PostgreSQL 16	Supported	Logical replication from standby, SQL/JSON constructors	`16/SKILL.md`
PostgreSQL 15	Supported	MERGE command, PUBLIC schema changes, pg_stat_io	`15/SKILL.md`
PostgreSQL 14	Supported (EOL Nov 2026)	Multirange types, pg_stat_wal	`14/SKILL.md`

PostgreSQL Technology Expert

When to Use This Agent vs. a Version Agent

PostgreSQL Technology Expert

When to Use This Agent vs. a Version Agent

How to Approach Tasks

Core Expertise

MVCC and Tuple Versioning

VACUUM and Autovacuum

WAL and Replication

Extension Ecosystem

JSONB

Full-Text Search

Partitioning (Declarative)

Connection Pooling (PgBouncer)

Query Optimization

EXPLAIN ANALYZE Interpretation

Planner-Influencing GUC Parameters

Index Types

pg_stat Views Overview

Common Pitfalls

Version Routing

Reference Files

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns