Rules for provisioning isolated test environments for End-to-End orchestrator evaluations.
When writing end-to-end (E2E) integration testing and evaluations for the orchestration framework, you must enforce strict architectural boundaries to avoid corrupting global physical environments or local user workspaces.
EvalRunner instances must operate strictly inside isolated tempfile::TempDir scopes. This encapsulates .nancy database state and Git operations without overlapping with static configuration dependencies.NANCY_MOCK_LLM_RESPONSE securely bound within sealed_test blocks.Identity parameters (e.g., dedicated DIDs for test Coordinators and Grinders) to avoid UDS socket collisions during asynchronous testing.EvalRunner::wait_for_completion to synchronously wait for specific evaluation conditions (like specific presence) avoiding race conditions or daemon leaks before shutting down gracefully.EventPayload