Archivo del skill

Code Archaeology

Name: Code Archaeology
Author: taylrfnt

Use when analyzing large, legacy, or undocumented codebases to build a navigable knowledge graph for context retrieval.

taylrfnt0 estrellas7 feb 2026

Ocupación
Categorías: Base de Conocimientos

Contenido de la habilidad

Analyze large, legacy, or undocumented codebases and produce a persistent knowledge graph. The graph enables fast, focused context retrieval in future sessions — any AI agent (Amp, Copilot, etc.) can query it to understand unfamiliar code without re-reading everything.

Workflow

Index the codebase → produces a knowledge graph under <repo>/archaeology/kg/
Query the graph → returns focused markdown context bundles
Use context bundles in coding sessions for navigation, refactoring, or onboarding

Indexing

Run scripts/index.py to build or update the knowledge graph.

python scripts/index.py <repo-root> [options]

Option	Default	Description
`--output-dir`

Skills relacionados

Code Archaeology | Skills Pool

python scripts/query_graph.py <kg-dir> [options]

Option	Default	Description
`--symbol <name>`	—	Find nodes matching a symbol name
`--path <glob>`	—	Filter by file path
`--tags <tag,...>`	—	Filter by tags (e.g., `god_object,hidden_io`)
`--hops <n>`	2	Max edge traversal depth from matched nodes
`--max-nodes <n>`	50	Cap on returned nodes
`--format`	`markdown`	Output format (`markdown` or `json`)

Type	Description
`file`	Source file
`module`	Language module / namespace
`package`	Package / crate / gem
`class`	Class or struct
`type`	Type alias, interface, protocol
`function`	Free function
`method`	Method on a class/type
`endpoint`	HTTP / RPC / GraphQL endpoint
`config`	Configuration key or block
`datastore`	Database, cache, queue
`event`	Event or message type
`job`	Background job / cron task
`test`	Test case or suite
`build_target`	Build rule or target
`external_service`	Third-party service dependency
`doc`	Documentation artifact

Edge	Meaning
`contains`	Parent structurally contains child
`defines`	File/module defines a symbol
`imports`	Source imports target
`calls`	Source invokes target
`implements`	Source implements target interface
`inherits`	Source extends target
`reads`	Source reads from datastore/config
`writes`	Source writes to datastore/config
`emits`	Source emits event
`consumes`	Source consumes event
`exposes`	Module exposes an endpoint
`uses_config`	Source references config key
`depends_on`	Build/deploy dependency
`tests`	Test covers target
`documents`	Doc documents target

{"file": "src/server.py", "start_line": 42, "end_line": 58}

archaeology/kg/
├── nodes.jsonl
├── edges.jsonl
├── files.jsonl
├── indexes/
│   ├── by_symbol.json
│   ├── by_path.json
│   └── by_tag.json
└── summaries/
    ├── <module>.md
    └── overview.md

Tag	Meaning
`god_object`	Class/module with excessive responsibilities
`feature_envy`	Entity that over-references another module's internals
`duplicate_logic`	Near-duplicate implementations across files
`hidden_io`	I/O buried inside business logic
`stringly_typed_config`	Config accessed via raw strings without validation
`shared_mutable_state`	Globals or shared state without synchronization
`temporal_coupling`	Operations that must happen in a specific undocumented order

Code Archaeology

Workflow

Indexing

Code Archaeology

Workflow

Indexing

Querying

Graph Structure

Node Types

Edge Types

Evidence Pointers

Output Structure

Agent-Driven Fallback

Traversal Strategy

Anti-Pattern Detection

Incremental Updates

Design Principles

Feishu Wiki

Gemini

Clawhub

Notion

Sherpa Onnx Tts

Openai Whisper Api