Archivo del skill

DispatchPulse Operations Skill

Name: DispatchPulse Operations Skill
Author: Arun-Sanjay

Operational playbook for the DispatchPulse Meta PyTorch OpenEnv Hackathon submission. Load this when working in /Users/arunsanjay/Documents/Projects/DispatchPulse or when Arun mentions the hackathon, DispatchPulse, Round 1/Round 2, HF Space Arun-Sanjay/dispatchpulse, or the GitHub repo Arun-Sanjay/DispatchPulse. Covers how to test, debug, deploy, and respond to Phase 1/Phase 2 validator failures without breaking the existing passing submission.

Arun-Sanjay0 estrellas8 abr 2026

Ocupación
Categorías: CI/CD

Contenido de la habilidad

Scope: This skill is the how-to companion to CLAUDE.md. CLAUDE.md tells you WHAT the project is. This skill tells you WHAT TO DO — step-by-step playbooks for every common scenario.

First action in any session: read CLAUDE.md at the project root. Then find your scenario in section 2 below and follow it. Do not improvise — every command here is tested.

1. Session bootstrap (do this every time)

cd /Users/arunsanjay/Documents/Projects/DispatchPulse

# Sanity check: working tree clean, on main, venv exists
git status
git log --oneline | head -5
git remote -v
ls .venv/bin/python

Expected state:

On branch main, nothing to commit, working tree clean
Latest commit should be 82ce364 Fix Phase 2: add GET /tasks and POST /grader endpoints (or whatever the user has pushed since)
Two remotes: origin (HF Space) and (GitHub)

Skills relacionados

DispatchPulse Operations Skill | Skills Pool

github

.venv/bin/python tests/test_reward.py && .venv/bin/python tests/test_simulation.py
# Expected: "All reward tests passed!" and "All simulation tests passed!"

# 1. Does the Dockerfile still exist at the repo root?
ls -la Dockerfile

# 2. Is the base image pullable from GHCR?
#    (only do this if Arun has Docker — skip otherwise)
docker pull ghcr.io/meta-pytorch/openenv-base:latest 2>&1 | tail -5

# 3. Is pyproject.toml's git-based openenv-core dep still working?
#    Check by letting uv re-resolve:
.venv/bin/uv lock --upgrade-package openenv-core 2>&1 | tail -20

# 1. Does inference.py run locally in the in-process fallback?
DISPATCHPULSE_TASK=easy .venv/bin/python inference.py 2>&1 | head -20
# Must print [START], [STEP]s, [END] with valid format

# 2. Does it handle missing HF_TOKEN gracefully?
unset HF_TOKEN API_KEY
DISPATCHPULSE_TASK=easy .venv/bin/python inference.py 2>&1 | grep -E '^\[(START|STEP|END)\]' | tail -5

# 3. Does it still emit [END] on exception?
DISPATCHPULSE_TASK=easy MODEL_NAME="definitely-not-a-real-model" .venv/bin/python inference.py 2>&1 | grep -E '^\[END\]'

# Capture a real run and inspect line by line
DISPATCHPULSE_TASK=easy .venv/bin/python inference.py > /tmp/out.log 2>&1
grep -E '^\[(START|STEP|END)\]' /tmp/out.log | cat -A  # -A shows hidden chars

# 1. Does /tasks return 3 tasks?
curl -sf https://arun-sanjay-dispatchpulse.hf.space/tasks | python3 -m json.tool | grep has_grader

# 2. Does /grader work for each task?
for t in easy medium hard; do
  curl -sf -X POST https://arun-sanjay-dispatchpulse.hf.space/grader \
    -H "Content-Type: application/json" -d "{\"task_id\":\"$t\",\"seed\":42}" \
    | python3 -c "import sys,json; d=json.load(sys.stdin); print(f'{d[\"task_id\"]}: score={d[\"score\"]:.3f} passed={d[\"passed\"]}')"
done

# 3. Does openenv.yaml declare 3 tasks with has_grader: true?
grep -c "has_grader" openenv.yaml  # should output 3

# task_definitions.py
from dataclasses import dataclass

@dataclass(frozen=True)
class TaskDefinition:
    task_id: str
    name: str
    difficulty: str  # "easy" | "medium" | "hard"
    description: str
    max_steps: int

TASKS = {
    "easy": TaskDefinition(
        task_id="easy", name="easy", difficulty="easy",
        description="...", max_steps=30,
    ),
    "medium": TaskDefinition(
        task_id="medium", name="medium", difficulty="medium",
        description="...", max_steps=45,
    ),
    "hard": TaskDefinition(
        task_id="hard", name="hard", difficulty="hard",
        description="...", max_steps=60,
    ),
}

import re

def parse_action_text(text: str) -> DispatchPulseAction:
    """Lenient parser: tolerates markdown, prefixes, function call syntax."""
    text = (text or "").strip()

    # Strip markdown code fences
    text = re.sub(r"^```\w*\n?", "", text)
    text = re.sub(r"\n?```$", "", text)
    text = text.strip()

    # Take first non-empty line
    for line in text.splitlines():
        line = line.strip().strip("`").strip()
        if line:
            text = line
            break

    # Strip common prefixes
    for prefix in ("Action:", "action:", "ACTION:", "Response:", "> "):
        if text.startswith(prefix):
            text = text[len(prefix):].strip()

    # Strip trailing period / quotes
    text = text.rstrip(".\"' ")

    # Try function-call syntax: dispatch(CALL-001, ALS-1)
    match = re.match(r"(\w+)\s*\((.*)\)$", text)
    if match:
        fn = match.group(1).lower()
        args = [a.strip().strip("'\"") for a in match.group(2).split(",")]
        text = f"{fn} " + " ".join(args)

    # Now use the existing space-separated parser
    parts = text.split(maxsplit=4)
    # ... rest of existing logic

git checkout -b round2-experiments
# Make experimental changes here
# Never merge back to main unless we know Round 1 is done with

.venv/bin/python tests/test_reward.py
# Expected: "All reward tests passed!"

.venv/bin/python tests/test_simulation.py
# Expected: "All simulation tests passed!"

# Start server in background
ENABLE_WEB_INTERFACE=true .venv/bin/uvicorn server.app:app --host 127.0.0.1 --port 8765 > /tmp/dp.log 2>&1 &
SERVER_PID=$!
sleep 4

# Exercise every endpoint
curl -sf http://127.0.0.1:8765/health                    # {"status":"healthy"}
curl -sf http://127.0.0.1:8765/tasks | python3 -m json.tool
curl -sf http://127.0.0.1:8765/tasks/easy | python3 -m json.tool
curl -sf -X POST http://127.0.0.1:8765/reset \
  -H "Content-Type: application/json" \
  -d '{"task_name":"easy","seed":42}' | python3 -m json.tool | head -20
curl -sf -X POST http://127.0.0.1:8765/step \
  -H "Content-Type: application/json" \
  -d '{"action":{"action_type":"wait","minutes":2,"text":"wait 2","metadata":{}}}' | python3 -m json.tool | head -20
curl -sf -X POST http://127.0.0.1:8765/grader \
  -H "Content-Type: application/json" \
  -d '{"task_id":"easy","seed":42}' | python3 -m json.tool

# Cleanup
kill $SERVER_PID 2>/dev/null

BASE=https://arun-sanjay-dispatchpulse.hf.space

curl -sf $BASE/health
curl -sf $BASE/tasks | python3 -m json.tool | head -30
curl -sf -X POST $BASE/reset -H "Content-Type: application/json" -d '{"task_name":"easy","seed":42}' | head -c 300
curl -sf -X POST $BASE/grader -H "Content-Type: application/json" -d '{"task_id":"easy","seed":42}' | python3 -m json.tool

./scripts/validate-submission.sh https://arun-sanjay-dispatchpulse.hf.space .

DISPATCHPULSE_TASK=easy .venv/bin/python inference.py 2>&1 | grep -E '^\[(START|STEP|END)\]'

curl -sf https://huggingface.co/api/spaces/Arun-Sanjay/dispatchpulse \
  | python3 -c "import sys,json; d=json.load(sys.stdin); rt=d.get('runtime',{}); print('stage:', rt.get('stage'), 'sha:', rt.get('sha')[:7], 'lastModified:', d.get('lastModified'))"

git add <specific files>
git commit -m "<imperative verb> <thing>: <why>"

.venv/bin/python tests/test_reward.py && .venv/bin/python tests/test_simulation.py
./scripts/validate-submission.sh https://arun-sanjay-dispatchpulse.hf.space .

# Push to HF Space (triggers rebuild)
git push https://Arun-Sanjay:[email protected]/spaces/Arun-Sanjay/dispatchpulse main

# Push to GitHub
git push https://Arun-Sanjay:[email protected]/Arun-Sanjay/DispatchPulse.git main

# Poll the HF Space until stage=RUNNING with the new commit sha
for i in 1 2 3 4 5; do
  sleep 45
  curl -sf https://huggingface.co/api/spaces/Arun-Sanjay/dispatchpulse \
    | python3 -c "import sys,json; d=json.load(sys.stdin); rt=d.get('runtime',{}); print(f'poll {$i}: stage={rt.get(\"stage\")} sha={rt.get(\"sha\",\"\")[:7]}')"
  STATUS=$(curl -s -o /dev/null -w "%{http_code}" https://arun-sanjay-dispatchpulse.hf.space/tasks)
  if [ "$STATUS" = "200" ]; then
    echo "LIVE"; break
  fi
done

DispatchPulse Operations Skill

1. Session bootstrap (do this every time)

DispatchPulse Operations Skill

1. Session bootstrap (do this every time)

2. Scenario playbooks

2.1 "Phase 2 passed — we're done"

2.2 "Phase 2 failed — which check?"

2.3 Docker Build Creation failed

2.4 inference.py Execution failed

2.5 Output Parsing failed

2.6 Task Validation failed

2.7 LLM Criteria Check failed

2.8 "Round 2 prep — make a branch for experiments"

2.9 "Can we make upgrades without breaking the submission?"

3. Testing reference

Unit tests (must stay green on every commit)

Local FastAPI smoke test

Live Space smoke test

Validator script (mirrors the Phase 1 checks)

inference.py end-to-end (in-process)

HF Space runtime status (is it healthy right now?)

4. Deploy workflow (Arun runs the pushes, you prepare them)

Step 1: commit locally

Step 2: verify locally

Step 3: hand Arun the push commands

Step 4: watch the rebuild

Step 5: tell Arun to resubmit on Scaler

5. What NOT to do (load-bearing don'ts)

6. Common mistakes from prior sessions (avoid repeating)

7. Emergency contacts & links

8. Resumption prompt — for when Arun starts a fresh Claude session

Github

Openclaw Parallels Smoke

Update Screenshots

Azure Pipelines

Deployment Patterns

Deployment Patterns