Codex shell compatibility:

When shell steps call the GPD CLI, use /home/qol/.gpd/venv/bin/python -m gpd.runtime_cli --runtime codex --config-dir ./.codex --install-scope local instead of the ambient gpd on PATH.
If you intentionally need the repo environment, keep the runtime pin: GPD_ACTIVE_RUNTIME=codex uv run gpd .... </codex_runtime_notes>

<objective> Create executable phase prompts (PLAN.md files) for a research-project phase with integrated research and verification.

Default flow: Research (if needed) -> Plan -> Verify -> Done

Planning scope: Each plan should address:

Mathematical approach -- formalism, representation, notation conventions
Computational strategy -- algorithms, numerical methods, computational tools
Validation plan -- analytic limits, symmetry checks, benchmark comparisons
Approximation scheme -- what is being neglected, regime of validity, error estimates

Parse arguments, validate phase, research domain (unless skipped), spawn gpd-planner, verify with gpd-plan-checker, iterate until pass or max iterations, present results. </objective>

<purpose> Create executable phase prompts (PLAN.md files) for a research phase with integrated literature review and verification. Default flow: Research (if needed) -> Plan -> Verify -> Done. Orchestrates gpd-phase-researcher, gpd-planner, and gpd-plan-checker agents with a revision loop (max 3 iterations). </purpose> <process> </process>    <context> Phase number: $ARGUMENTS (optional -- auto-detects next unplanned phase if omitted) <process> **CRITICAL: First, read the full workflow file using the read_file tool:** Read the file at ./.codex/get-physics-done/workflows/plan-phase.md — this contains the complete step-by-step instructions. Do NOT improvise. Follow the workflow file exactly.

Codex shell compatibility:

When shell steps call the GPD CLI, use /home/qol/.gpd/venv/bin/python -m gpd.runtime_cli --runtime codex --config-dir ./.codex --install-scope local instead of the ambient gpd on PATH.
If you intentionally need the repo environment, keep the runtime pin: GPD_ACTIVE_RUNTIME=codex uv run gpd .... </codex_runtime_notes>

Default flow: Research (if needed) -> Plan -> Verify -> Done

Planning scope: Each plan should address:

Mathematical approach -- formalism, representation, notation conventions
Computational strategy -- algorithms, numerical methods, computational tools
Validation plan -- analytic limits, symmetry checks, benchmark comparisons
Approximation scheme -- what is being neglected, regime of validity, error estimates

Parse arguments, validate phase, research domain (unless skipped), spawn gpd-planner, verify with gpd-plan-checker, iterate until pass or max iterations, present results. </objective>

if [ "$RESEARCH_MODE" = "explore" ]; then # Explore: always re-research for broader coverage echo "Research mode: explore — re-researching for comprehensive coverage" # Proceed to spawn researcher below elif [ "$RESEARCH_MODE" = "exploit" ]; then # Exploit: reuse only if existing research already covers the exact method family # and contract-critical anchor/comparison path for this phase if echo "$INIT" | /home/qol/.gpd/venv/bin/python -m gpd.runtime_cli --runtime codex --config-dir ./.codex --install-scope local json get .research_content --default "" | grep -qi "method\\|benchmark\\|anchor"; then echo "Research mode: exploit — existing targeted research appears sufficient" # Skip to step 6 else echo "Research mode: exploit — existing research is too generic, refreshing targeted method context" # Proceed to spawn researcher below fi elif [ "$RESEARCH_MODE" = "adaptive" ]; then # Adaptive: narrow only after prior decisive evidence or an explicit approach lock VALIDATED=$(ls .gpd/phases/*/*-SUMMARY.md 2>/dev/null | xargs grep -El "approach_validated: true|comparison_verdicts:|contract_results:" 2>/dev/null | head -1) if [ -n "$VALIDATED" ]; then echo "Research mode: adaptive — prior decisive evidence found, using existing research as the starting point" # Skip to step 6 else echo "Research mode: adaptive — approach not yet locked, refreshing research before planning" # Proceed to spawn researcher below fi else # Balanced (default): check staleness before skipping RESEARCH_MOD=$(stat -f %m "${PHASE_DIR}"/*-RESEARCH.md 2>/dev/null || stat -c %Y "${PHASE_DIR}"/*-RESEARCH.md 2>/dev/null || echo 0) STATE_MOD=$(stat -f %m .gpd/STATE.md 2>/dev/null || stat -c %Y .gpd/STATE.md 2>/dev/null || echo 0) DIFF_DAYS=$(( (STATE_MOD - RESEARCH_MOD) / 86400 )) if [ "$DIFF_DAYS" -gt 1 ]; then echo "Research may be stale (created ${RESEARCH_MOD}, state updated ${STATE_MOD}). Re-research with --research?" # If user chooses to re-research, proceed to spawn researcher below. Otherwise, use existing and skip to step 6. fi fi

<objective> Research how to approach Phase {phase_number}: {phase_name} Answer: "What mathematical methods, physical principles, and computational tools do I need to PLAN this phase rigorously?" </objective> <phase_context> IMPORTANT: If CONTEXT.md exists below, it contains user decisions from $gpd-discuss-phase. - **Decisions** = Locked -- research THESE deeply, no alternatives - **Agent's Discretion** = Freedom areas -- research options, recommend - **Deferred Ideas** = Out of scope -- ignore {context_content} </phase_context> <additional_context> **Phase description:** {phase_description} **Requirements:** {requirements} **Prior decisions:** {decisions} **Project contract:** {project_contract} **Active references:** {active_reference_context} **Reference artifacts:** {reference_artifacts_content} </additional_context> <research_mode>{RESEARCH_MODE}</research_mode> <physics_research_focus> **Research depth by mode:** - **explore:** COMPREHENSIVE — survey ALL viable methods, compare 3+ approaches, include failed approaches from literature, broad literature search (10+ papers), identify unexplored angles - **balanced** (default): STANDARD — identify best approach, document known difficulties, targeted literature (5-7 key papers) - **exploit:** MINIMAL — method-specific details only (parameters, convergence criteria, implementation notes). Skip broad survey. Only papers directly relevant to the exact computation. - **adaptive:** Use explore-style until prior decisive evidence or an explicit approach lock shows the method family is stable. Then narrow to a balanced or exploit-style pass for the locked method. **Core research areas (all modes):** - **Mathematical framework:** Identify the governing equations, symmetry groups, relevant Hilbert spaces, or variational principles - **Known solutions:** Find exact solutions, standard approximations (perturbative, WKB, mean-field), and their regimes of validity - **Limiting cases:** Identify all limiting cases that must be recovered (classical limit, weak-coupling, non-relativistic, thermodynamic limit, etc.) - **Computational methods:** Survey numerical approaches (finite element, Monte Carlo, spectral methods) and existing packages - **Literature:** Key papers, textbook treatments, and review articles relevant to this phase - **Dimensional analysis:** Identify natural scales and dimensionless parameters that govern the physics </physics_research_focus> <output> Write to: {phase_dir}/{phase}-RESEARCH.md </output>

<planning_context> **Phase:** {phase_number} **Mode:** {standard | gap_closure} **Plan depth:** {full | light} **Research mode:** {RESEARCH_MODE} **Autonomy:** {AUTONOMY} Planning requires an approved project contract. If `{project_contract}` is empty, stale, or too underspecified to identify the phase contract slice, return `## CHECKPOINT REACHED` instead of writing or revising plans from inferred scope. **Project State:** {state_content} **Project Contract:** {project_contract} **Roadmap:** {roadmap_content} **Requirements:** {requirements_content} **Protocol Bundles:** {protocol_bundle_context} **Active References:** {active_reference_context} **Reference Artifacts:** {reference_artifacts_content} **Phase Context:** IMPORTANT: If context exists below, it contains USER DECISIONS from $gpd-discuss-phase. - **Decisions** = LOCKED -- honor exactly, do not revisit - **Agent's Discretion** = Freedom -- make methodological choices - **Deferred Ideas** = Out of scope -- do NOT include {context_content} **Research:** {research_content} **Experiment Design (if exists):** {experiment_design_content} **Gap Closure (if --gaps):** {verification_content} {validation_content} </planning_context> <physics_planning_requirements> Each plan MUST include: - **Mathematical rigor checkpoints:** Points where derivations must be verified for dimensional consistency, symmetry preservation, and correct tensor structure - **Limiting case validation:** Explicit checks that results reduce correctly in all known limits (classical, non-relativistic, weak-coupling, thermodynamic, etc.) - **Order-of-magnitude estimates:** Before any detailed calculation, estimate the expected scale of the answer - **Error budget:** For numerical work, specify target precision and identify dominant error sources - **Consistency checks:** Cross-checks between independent methods or approaches where possible - **Anchor discipline:** If a benchmark, paper, dataset, or prior artifact is contract-critical, surface it in the plan instead of treating it as optional background - **Contract completeness:** Every plan must include claims, deliverables, references, acceptance tests, forbidden proxies, and uncertainty markers in frontmatter - **Protocol bundle coverage:** If protocol bundles are selected, carry their estimator policies, decisive artifact guidance, and verifier extensions into the plan explicitly </physics_planning_requirements> <contract_requirements> Planning requires `project_contract`: - If `project_contract` is empty, stale, or too underspecified to identify the phase contract slice, return `## CHECKPOINT REACHED` instead of writing a weak or guessed plan. - Every PLAN.md must include a `contract` frontmatter block with exact IDs for claims, deliverables, references, acceptance tests, and forbidden proxies. - Every PLAN.md must carry forward required context from the contract: must-read refs, prior outputs, baselines, and user anchors when execution depends on them. - Every PLAN.md must include uncertainty markers from the contract when they constrain interpretation or verification. - Every PLAN.md should express result wiring through `contract.links` or explicit task/verification handoffs, not through a second ad hoc success schema. - Validate each finished plan with `gpd validate plan-contract <PLAN.md>` before treating it as approved. - Autonomy mode and model profile may change cadence or detail, but they do NOT relax contract completeness. </contract_requirements> <light_mode_instructions> **If plan depth is `light`:** Keep the full canonical frontmatter, including `wave`, `depends_on`, `files_modified`, `interactive`, `conventions`, and `contract`. Simplify only the body: one high-level task block per plan, concise verification, concise success criteria. The light plan is a shorter execution script, not a strategic outline that drops required contract fields. </light_mode_instructions> <context_budget_guidance> Context windows are finite (~200k tokens, ~80% usable). Plans must be sized accordingly: - **Target per plan:** ~50% context budget (40% for hypothesis-driven plans) - **Segment large phases** into multiple plans rather than one overloaded plan - **Flag context-heavy plans** in frontmatter: `context_note: "Heavy - consider splitting if >6 tasks"` - **Group related tasks** that share intermediate results in the same plan - **Use waves** for independent work -- each subagent gets a fresh context window **Signs a plan needs splitting:** >6-8 substantive tasks, multiple independent derivations, tasks requiring different large reference files, mix of symbolic derivation and numerical verification. See `./.codex/get-physics-done/references/orchestration/context-budget.md` for detailed budget allocation by workflow type. </context_budget_guidance> <downstream_consumer> Output consumed by $gpd-execute-phase. Plans need: - Frontmatter (wave, depends_on, files_modified, interactive, contract) - Tasks in XML format - Verification criteria with mathematical rigor requirements - contract-complete frontmatter before execution starts - contract links or explicit task-level dependency wiring for critical handoffs, including limiting-case checks - protocol-bundle guidance reflected in task structure, verification, and decisive artifact selection when applicable </downstream_consumer> <quality_gate> - [ ] PLAN.md files created in phase directory - [ ] Each plan has valid frontmatter - [ ] Each plan has a complete contract block (claims, deliverables, references, acceptance tests, forbidden proxies, uncertainty markers) - [ ] Each plan passes `gpd validate plan-contract <PLAN.md>` - [ ] Tasks are specific and actionable with clear mathematical deliverables - [ ] Dependencies correctly identified (including prerequisite derivations) - [ ] Waves assigned for parallel execution - [ ] Contract links or explicit task-level dependency wiring cover the decisive handoffs and limiting-case recovery path - [ ] Required refs, prior outputs, and baselines are surfaced in `<context>` or verification paths - [ ] Selected protocol bundles are reflected in verification paths or decisive artifact choices where relevant - [ ] Forbidden proxies are rejected explicitly in `<done>` or `<success_criteria>` - [ ] Dimensional analysis check specified for each quantitative result - [ ] Validation checkpoints placed after each major derivation step </quality_gate>

<verification_context> **Phase:** {phase_number} **Phase Goal:** {goal from ROADMAP} **Plans to verify:** {plans_content} **Requirements:** {requirements_content} **Project Contract:** {project_contract} **Protocol Bundles:** {protocol_bundle_context} **Active References:** {active_reference_context} **Reference Artifacts:** {reference_artifacts_content} **Phase Context:** IMPORTANT: Plans MUST honor user decisions. Flag as issue if plans contradict. - **Decisions** = LOCKED -- plans must implement exactly - **Agent's Discretion** = Freedom areas -- plans can choose approach - **Deferred Ideas** = Out of scope -- plans must NOT include {context_content} </verification_context> <physics_verification_criteria> In addition to structural checks, verify: - [ ] **Dimensional consistency:** All equations are dimensionally correct - [ ] **Limiting cases specified:** Plans identify which limits must be recovered and where checks occur - [ ] **Approximation validity:** Each approximation has stated regime of validity and error estimates - [ ] **Conservation laws:** Plans respect relevant conservation laws (energy, momentum, charge, unitarity, etc.) - [ ] **Symmetry preservation:** Approximations and numerical methods preserve relevant symmetries - [ ] **Independent cross-checks:** At least one independent verification method per major result - [ ] **Order-of-magnitude sanity:** Expected scales are stated before detailed calculations - [ ] **Anchor coverage:** Required references, baselines, and prior outputs are surfaced where the plan depends on them - [ ] **Protocol-bundle coverage:** Selected protocol bundles are reflected in task structure, estimator guards, decisive artifacts, or verification paths - [ ] **Contract completeness:** Each plan includes decisive claims, deliverables, acceptance tests, forbidden proxies, and uncertainty markers - [ ] **Decisive outputs:** The plan set covers decisive claims and deliverables rather than only infrastructure or proxy work - [ ] **Acceptance tests:** Every decisive claim or deliverable has at least one executable or reviewable test - [ ] **Disconfirming path:** Risky plans name the observation or comparison that would force a rethink - [ ] **Forbidden proxies:** Proxy-only success conditions are rejected explicitly </physics_verification_criteria> <expected_output> - ## VERIFICATION PASSED -- all checks pass - ## ISSUES FOUND -- structured issue list - ## PARTIAL APPROVAL -- some plans approved, others need revision (see partial_approval protocol in your agent instructions) </expected_output>

Mode	RESEARCH.md exists	RESEARCH.md missing	`--research` flag
explore	Re-research always (expand scope, compare alternatives, refresh anchors)	Research (comprehensive — multiple methods, broad survey)	Research (comprehensive)
balanced (default)	Skip by default, but re-research if inputs look stale or missing for the current contract slice	Research (standard)	Research (standard)
exploit	Skip only if the existing research already covers the exact method family, anchor set, and decisive evidence path; otherwise run targeted method research	Research (minimal — method-specific only, no broad survey)	Research (minimal)
adaptive	Reuse existing research only after prior decisive evidence or explicit approach-lock markers show the method is stable; otherwise refresh research in a balanced or explore-style pass	Research (broad enough to choose and lock an approach)	Research (standard)

Mode	Experiment Designer	Rationale
explore	Always spawn (even if numerical indicators are weak)	Broad exploration benefits from structured experiment design
balanced	Spawn if numerical indicators detected (default behavior)	Standard heuristic
exploit	Skip unless EXPERIMENT-DESIGN.md is explicitly required by CONTEXT.md	Exploit mode minimizes overhead
adaptive	Follow balanced behavior until prior decisive evidence or an explicit approach lock stabilizes the method family; then reuse validated experiment templates for the locked approach	Evidence-driven reuse once the method is stable

Wave	Plans	What it builds
1	01, 02	[objectives]
2	03	[objective]

Domain keywords	Blueprint applied
amplitude, Feynman, loop, renormalization	QFT: diagrams → integrals → renormalization → observables
Hamiltonian, order parameter, phase diagram	Condensed matter: symmetries → mean-field → fluctuations → response
partition function, critical exponent, Ising	Statistical mechanics: parallel analytical + numerical
spacetime, metric, gravitational wave	GR/cosmology: gauge choice first → constraints throughout
atom-light, Rabi, cavity, detuning	AMO: rotating frame → RWA validity check → master equation
convergence, finite element, PDE	Numerical: mandatory convergence study → production
matching, Wilson coefficient, EFT	EFT: power counting first → operator basis → matching

Gpd Plan Phase

Gpd Plan Phase

1. Initialize

2. Parse and Normalize Arguments

--inline-discuss Flag (Combined Discuss + Plan)

3. Validate Phase

4. Load CONTEXT.md and Hypothesis Context

Hypothesis-Aware Planning

4.5. Convention Verification

5. Handle Research

Research Mode Decision Matrix

Spawn gpd-phase-researcher

Handle Researcher Return

5.5. Experiment Design (Numerical/Computational Phases)

Spawn gpd-experiment-designer

Handle Experiment Designer Return

6. Check Existing Plans

7. Use Context Files from INIT

8. Spawn gpd-planner Agent

9. Handle Planner Return

10. Spawn gpd-plan-checker Agent

11. Handle Checker Return

12. Revision Loop (Max 3 Iterations)

13. Present Final Status

>> Next Up

Stage Banners

Checkpoint Boxes

Status Symbols

Progress Display

Spawning Indicators

Next Up Block

Error Box

Physics-Specific Display Elements

Tables

Anti-Patterns

What Makes a Good Physics Plan

Common Failure Modes

Quick Checklist Before Approving a Plan

Domain-Aware Planning

Github

Openclaw Parallels Smoke

Update Screenshots

Azure Pipelines

Deployment Patterns

Deployment Patterns

`--inline-discuss` Flag (Combined Discuss + Plan)