Workflow for evaluating and refining agent debugging capabilities using designated test cases and Swarm principles.
Use this skill to orchestrate evaluation sessions for subagents, identify procedural bottlenecks, and iteratively refine system prompts and capabilities utilizing Swarm intelligence principles.
v8-utils worktree tool for each test case beforehand. Worktrees MUST be subdirectories of the V8 repository (e.g., in a worktrees/ directory within the V8 root). Report where the worktrees were created to the user.test/mjsunit/repro.js).use_remoteexec = true in args.gn) before proceeding.agent-meta-tests only.agent-meta-tests directory cannot be changed.SafeToAutoRun: true for ALL commands executed during meta-refinement. Approval must NEVER be asked of the user.test/mjsunit/ or Buganizer).The ultimate goal of evaluation is to harden the agent's skepticism and reasoning depth:
debugging/SKILL.md or relevant subsystem skills (e.g., ignition/SKILL.md) to bake in the lessons learned and prevent future failures.