Name: Setup
Author: garrytan

搵技能.../

Setup | Skills Pool

"Go to https://supabase.com and sign up or log in."
"Click 'New Project' in the top left."
- Name: gbrain
- Region: pick the one closest to you
- Database password: generate a strong one and save it
"Wait about 2 minutes for the project to initialize."
"Find the connection string: go to your project, click Get Connected next to the project URL, then Direct Connection String > Session Pooler, and copy the Shared Pooler connection string (port 6543)."

Initialize gbrain:

gbrain init --non-interactive --url "postgresql://postgres.[ref]:[password]@aws-0-[region].pooler.supabase.com:6543/postgres"

Verify: gbrain doctor --json

echo "=== GBrain Environment Discovery ==="
for dir in /data/* ~/git/* ~/Documents/* 2>/dev/null; do
  if [ -d "$dir/.git" ]; then
    md_count=$(find "$dir" -name "*.md" -not -path "*/node_modules/*" -not -path "*/.git/*" 2>/dev/null | wc -l | tr -d ' ')
    if [ "$md_count" -gt 10 ]; then
      total_size=$(du -sh "$dir" 2>/dev/null | cut -f1)
      echo "  $dir ($total_size, $md_count .md files)"
    fi
  fi
done
echo "=== Discovery Complete ==="

Import the best candidate. For large imports (>1000 files), use nohup to survive session timeouts:
```
nohup gbrain import <dir> --no-embed --workers 4 > /tmp/gbrain-import.log 2>&1 &
```
Then check progress: tail -1 /tmp/gbrain-import.log

For smaller imports, run directly:
```
gbrain import <dir> --no-embed
```
Prove search works. Pick a semantic query based on what you imported:
```
gbrain search "<topic from the imported data>"
```
This is the magical moment: the user sees search finding things grep couldn't.
Start embeddings. Refresh stale embeddings (runs in background). Keyword search works NOW, semantic search improves as embeddings complete.
Backfill the knowledge graph. Populate typed links and structured timeline from the imported pages. Auto-link maintains both going forward, but historical pages need a one-time backfill.
```
gbrain extract links --source db --dry-run | head -20    # preview
gbrain extract links --source db                         # commit
gbrain extract timeline --source db                      # dated events
gbrain stats                                             # verify links > 0
```
After this, gbrain graph-query <slug> --depth 2 works and search ranks well-connected entities higher. Idempotent — safe to re-run anytime. Supports --since YYYY-MM-DD for incremental runs on huge brains.

Skip if Phase C imported zero pages (auto-link handles new writes).

Offer file migration. If the repo has binary files (.raw/ directories with images, PDFs, audio):

"You have N binary files (X GB) in your brain repo. Want to move them to cloud storage? Your git repo will drop from X GB to Y MB. All links keep working."

If the user agrees, configure storage and run migration:

# Configure storage backend (Supabase Storage recommended)
gbrain config set storage.backend supabase
gbrain config set storage.bucket brain-files
gbrain config set storage.projectUrl <supabase-url>
gbrain config set storage.serviceRoleKey <service-role-key>

# Migrate binary files to cloud (3-step lifecycle)
gbrain files mirror <brain-dir>       # Upload to cloud, keep local
gbrain files redirect <brain-dir>     # Replace local with .redirect.yaml pointers
# (optional) gbrain files clean <brain-dir> --yes   # Remove pointers too

After migration, gbrain files upload-raw handles new files automatically: small text/PDFs stay in git, large/media files go to cloud with .redirect.yaml pointers. Files >= 100 MB use TUS resumable upload for reliability.

gbrain apply-migrations --yes       # applies any pending migrations; idempotent on healthy installs
gbrain autopilot --install          # supervises itself + forks the Minions worker; env-aware

Task	Before (grep)	After (gbrain)
Find a person	`grep -r "Pedro" brain/`	`gbrain search "Pedro"`
Understand a topic	`grep -rl "deal" brain/ \| head -5 && cat ...`	`gbrain query "what's the status of the deal"`
Read a known page	`cat brain/people/pedro.md`	`gbrain get people/pedro`
Find connections	`grep -rl "Brex" brain/ \| xargs grep "Pedro"`	`gbrain query "Pedro Brex relationship"`

gbrain sync --no-pull --no-embed

Layer	What it stores	When to use
gbrain	World knowledge: people, companies, deals, meetings, concepts, media	"Who is Pedro?", "What happened at the board meeting?"
memory_search	Agent operational state: preferences, decisions, session context	"How does the user like formatting?", "What did we decide about X?"

What You See	Why	Fix
Connection refused	Supabase project paused, IPv6, or wrong URL	Use Session pooler (port 6543), or supabase.com/dashboard > Restore
Password authentication failed	Wrong password	Project Settings > Database > Reset password
pgvector not available	Extension not enabled	Run `CREATE EXTENSION vector;` in SQL Editor
OpenAI key invalid	Expired or wrong key	platform.openai.com/api-keys > Create new
No pages found	Query before import	Import files into gbrain first
RLS not enabled	Security gap	Run `gbrain init` again (auto-enables RLS)

GBRAIN SETUP COMPLETE
=====================

Engine: [PGLite / Supabase Postgres]
Connection: [verified / pooler mode confirmed]
Pages imported: N
Embeddings: N/N (keyword search active, semantic improving)
Live sync: [configured / method]
Health check: all OK / [specific failures]
Verification: [GBRAIN_VERIFY.md results]

Next steps:
- Read docs/GBRAIN_SKILLPACK.md for production agent patterns
- [any pending items]

Setup

Setup GBrain

Contract

Install (if not already installed)

How GBrain connects

Setup

Setup GBrain

Contract

Install (if not already installed)

How GBrain connects

Why Supabase

Prerequisites

Available init options

Phase A: Supabase Setup (recommended)

Phase B: BYO Postgres (alternative)

Phase C: First Import

Phase C.5: One-step autopilot + Minions install (v0.11.1+)

Phase D: Brain-First Lookup Protocol

BEFORE (grep) vs AFTER (gbrain)

Lookup sequence (MANDATORY for every entity question)

Sync-after-write rule

gbrain vs memory_search

Phase E: Load the Production Agent Guide

Phase F: Health Check

Error Recovery

Phase G: Auto-Update Check (if not already configured)

Phase H: Live Sync Setup (MUST ADD)

Phase I: Full Verification

Schema State Tracking

Anti-Patterns

Output Format

Tools Used

Feishu Drive

Nanoclaw Repl

Crosspost

Cloudflare

Mcp Integration

Setup Deploy