Use this skill to diagnose strategy behavior from completed backtest artifacts in this repo.

Preferred inputs:

one or more run IDs under artifacts/runs/
the strategy names or sibling variants to compare
optional calendar windows to slice

Preferred artifact sources:

artifacts/runs/<run_id>/report/report.json
artifacts/runs/<run_id>/report/summary.json
artifacts/runs/<run_id>/logs/trace.jsonl
artifacts/runs/<run_id>/logs/default.jsonl

Use This Workflow

Follow this order. Do not jump to strategy changes before the diagnosis steps are done.

1. Separate the question

Answer these independently:

What market condition is the strategy implicitly designed for?
Where does it actually make money?
Where does it lose money?
Are losses caused by entry logic, exit logic, regime gating, or sizing/execution?

If a user asks a vague question like “is this strategy good,” translate it into those four questions first.

Use this skill to diagnose strategy behavior from completed backtest artifacts in this repo.

Preferred inputs:

one or more run IDs under artifacts/runs/
the strategy names or sibling variants to compare
optional calendar windows to slice

Preferred artifact sources:

artifacts/runs/<run_id>/report/report.json
artifacts/runs/<run_id>/report/summary.json
artifacts/runs/<run_id>/logs/trace.jsonl
artifacts/runs/<run_id>/logs/default.jsonl

Use This Workflow

Follow this order. Do not jump to strategy changes before the diagnosis steps are done.

1. Separate the question

Answer these independently:

What market condition is the strategy implicitly designed for?
Where does it actually make money?
Where does it lose money?
Are losses caused by entry logic, exit logic, regime gating, or sizing/execution?

If a user asks a vague question like “is this strategy good,” translate it into those four questions first.

Strategy Diagnosis

Use This Workflow

1. Separate the question

Strategy Diagnosis

Use This Workflow

1. Separate the question

2. Prefer full-run slicing over many sub-window reruns

3. Label regimes before interpreting metrics

4. Compute regime metrics

5. Diagnose behavior, not just aggregate metrics

6. Run post-decision drift

6.5. Check trade clustering before cooldown ideas

7. Compare counterfactual variants

8. Build the regime matrix

Standard Output

Repo Notes

Scripts In This Skill

References

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research

Strategy Diagnosis

Use This Workflow

1. Separate the question

Strategy Diagnosis

Use This Workflow

1. Separate the question

2. Prefer full-run slicing over many sub-window reruns

3. Label regimes before interpreting metrics

4. Compute regime metrics

5. Diagnose behavior, not just aggregate metrics

6. Run post-decision drift

6.5. Check trade clustering before cooldown ideas

7. Compare counterfactual variants

8. Build the regime matrix

9. Only then recommend changes

Standard Output

Repo Notes

Scripts In This Skill

References

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research