技能档案

Codex

Name: Codex
Author: spencer-life

This skill invokes Codex CLI (GPT-5.4) for web research, second opinions, code review, or autonomous terminal tasks. It should be used when the user says "use codex", "ask codex", "codex review", "codex research", "search the web for", or invokes /codex. Also auto-invoked by subagent-driven-development at phase boundaries.

spencer-life0 星标2026年4月16日

职业
分类: 命令行工具

技能内容

Codex CLI Integration

Invoke OpenAI Codex CLI (GPT-5.4) as a specialist tool. Claude stays the orchestrator; Codex is a power tool for specific strengths.

For Spencer's workflow, default to read-only verification first. Use Codex as an independent auditor before using it as an implementer.

Subcommands

Subcommand	When to use	Example
`exec`	General task delegation	`/codex fix the nginx 502 error`
`review`	Code review (second opinion)	`/codex review`
`adversarial-review`	Structured adversarial review (JSON with severity/confidence)	`/codex adversarial-review`
`research`	Web search / current events	`/codex research latest Node LTS`

相关技能

Codex | Skills Pool

Parse the user's intent — determine which subcommand fits:
- Web/current events → research
- Code review (free text) → review
- Code review (structured/automated triage) → adversarial-review
- Second opinion → compare
- Background job management → status / cancel / result
- Everything else → exec
Invoke the wrapper script via Bash tool:
```
bash ~/.claude/hooks/codex-delegate.sh <subcommand> "<prompt>" [flags]
```
Available flags:
- --timeout N — seconds (default 300)
- --sandbox MODE — read-only (default) or workspace-write
- --cwd DIR — working directory
- --output FILE — custom output path
- --worktree — run in isolated git worktree
- --search — enable live web search (for review, compare, exec)
- --background — fork to background, return job ID (use status/result to check)
Note: research always enables web search automatically. Use --search on other subcommands when Codex needs web access.
Present the result with clear attribution:
- Label output: Codex (GPT-5.4) says:
- For compare: present Codex's view, then Claude's view, then a Synthesis combining both
Handle errors gracefully:
- Exit 2 (timeout): "Codex timed out — here's my take instead: ..."
- Exit 3 (not installed): "Codex CLI isn't available. I'll handle this directly."
- Exit 1 (other error): "Codex hit an error. Here's what I think: ..."
Compare disagreement handling:
- If Claude and Codex agree: present consensus with both perspectives as support
- If they disagree: present both side-by-side with reasoning, recommend the stronger argument, let the user decide

Goal	Prompt shape	Shortcut
Independent review of Claude's work	`Use the claude-work-verifier skill...`	`just codex_verify "task summary"`
Evidence-first artifact check	`Use the artifact-verifier skill...`	`just codex_verify_artifacts "artifact summary"`
Audit a stale handoff before resume	`Use the handoff-auditor skill...`	`just codex_audit_handoff .claude/handoff.md`
Check ground-truth verification coverage	`Use the ground-truth-coverage skill...`	`just codex_ground_truth_audit "scope"`

{
  "verdict": "approve" | "needs-attention",
  "summary": "terse ship/no-ship assessment",
  "findings": [{
    "severity": "critical|high|medium|low",
    "title": "short title",
    "body": "what can go wrong and why",
    "file": "path/to/file",
    "line_start": 42,
    "line_end": 50,
    "confidence": 0.0-1.0,
    "recommendation": "concrete fix"
  }]
}

bash ~/.claude/hooks/codex-delegate.sh exec "fix the test" --background
# Returns: job ID

bash ~/.claude/hooks/codex-delegate.sh status
# Lists all jobs with status

bash ~/.claude/hooks/codex-delegate.sh result <job-id>
# Returns output from completed job

bash ~/.claude/hooks/codex-delegate.sh cancel <job-id>
# Kills a running job

Review the changes for this task:

TASK: {task description}
DIFF: {git diff output}
{If data task:} VERIFICATION: {pass/fail results from data-verify}

Focus on:
1. Does the implementation achieve the stated goal?
2. Are there edge cases or bugs the reviews might have missed?
3. For database changes: do the queries and data look correct?
4. What could break in production?

Give your independent assessment. Don't just say "looks good."

bash ~/.claude/hooks/codex-delegate.sh research "latest React 19 features" --timeout 60

bash ~/.claude/hooks/codex-delegate.sh review "" --search --cwd "$(pwd)"

bash ~/.claude/hooks/codex-delegate.sh compare "Review the architecture — is the service layer properly separated?" --search --cwd "$(pwd)"

bash ~/.claude/hooks/codex-delegate.sh exec "Fix the flaky test in auth.test.ts" --sandbox workspace-write --cwd "$(pwd)"

Codex

Codex CLI Integration

Subcommands

Codex

Codex CLI Integration

Subcommands

Steps

Preferred Verifier Flows

Adversarial Review

Background Jobs

Auto-Invocation (by subagent-driven-development)

Cost Awareness

Examples

Justfile Shortcuts

Oracle

Blucli

Peekaboo

Add Dock Band

Add Fallback Commands

Add Adaptive Card Form