Analyze all tests in the rms-cloud-tasks project and produce a report only—do not modify any test files. The report is intended to be used as a prompt for an AI agent (or developer) to fix the tests later.

Scope

All tests: Files under tests/ matching test_*.py (pytest).
Conftest: tests/conftest.py and tests/cloud_tasks/instance_manager/gcp/conftest.py.
Modules under test: cloud_tasks.common, cloud_tasks.queue_manager, cloud_tasks.instance_manager, cloud_tasks.worker, and CLI (cloud_tasks.cli).
Manual scripts: tests/manual/ contains shell scripts and configs for manual runs; exclude from automated-test critique or note as manual-only.

Checklist for Analysis

Apply these criteria when reviewing each test file and each test case.

1. Return values and assertions

Scope

All tests: Files under tests/ matching test_*.py (pytest).
Conftest: tests/conftest.py and tests/cloud_tasks/instance_manager/gcp/conftest.py.
Modules under test: cloud_tasks.common, cloud_tasks.queue_manager, cloud_tasks.instance_manager, cloud_tasks.worker, and CLI (cloud_tasks.cli).
Manual scripts: tests/manual/ contains shell scripts and configs for manual runs; exclude from automated-test critique or note as manual-only.

Checklist for Analysis

Apply these criteria when reviewing each test file and each test case.

1. Return values and assertions

# Test Suite Critique Report **Generated:** [date] **Scope:** tests/ (pytest); modules: common, queue_manager, instance_manager, worker, cli ## Executive summary - Overall assessment (strengths, main gaps). - **Coverage:** At least 80%; measured by running the entire test suite. Note if 80% is met and whether measurement is full-suite. - **Exception messages:** When testing exceptions with defined messages, tests must assert on message contents (e.g. `pytest.raises(...) as exc_info`, then `str(exc_info.value)`). Note violations. - High-priority fixes vs. nice-to-have. ## 1. Return values and assertions - Tests that only check existence or non-None; suggest exact value or type/format checks. - Lists asserted with `>= N` where exact length is knowable; suggest exact length. - Missing shape or key checks for dicts/objects. ## 2. Success and failure conditions - Per area (common, queue_manager, instance_manager, worker, cli): table of "behavior | tested? | notes". - Missing: validation errors, missing config, provider/not-found errors, edge cases. ## 3. Consistency - Naming/structure inconsistencies with examples. - Fixture usage and duplication across files. ## 4. Completeness - Coverage map (what's tested, what's missing per module). - Doc/spec gaps. ## 5. Redundancy - Duplicate or overlapping tests with file:test references. ## 6. Parallel execution and isolation - Global state, order dependence, GCP fixture scope and deepcopy usage, shared resources. ## 7. Mocking and dependency isolation - Real external calls, time-sensitive tests without freezing, env/argv dependencies, AsyncMock vs Mock. ## 8. Config and validation testing - Pydantic/config validation coverage, exception message assertions, load_config edge cases, secrets in tests. ## 9. Parameterization - Tests that could be parameterized, missing boundary value tests. ## 10. Async and concurrency - Async fixture usage, timeouts, worker multiprocessing/async isolation. ## 11. Error handling - Missing error body or message verification. - Exception tests that only assert type; require asserting message contents where defined. ## 12. State and workflow - Task DB transitions, worker lifecycle, CLI subcommands, idempotency. ## 13. Test data and fixtures - Unrealistic data, cleanup issues, fixture scope, GCP deepcopy usage. ## 14. Flakiness indicators - Time-based assertions, order dependence, external dependencies, unseeded randomness. ## 15. Regression and documentation - Missing bug references, spec/test alignment, manual-only coverage. ## 16. Other - Unclear tests, slow tests, missing assertion messages, multi-responsibility tests, AAA violations, logic in tests. ## 17. Code coverage - Target 80%; full-suite measurement. List modules below 80% or with significant uncovered lines. Note .coveragerc omit list. ## Prompt for an AI agent to fix tests This section is a **reusable prompt template** to be filled with the report output. Use the placeholders below; include either the full report or a summarized version as specified. **Template (fill placeholders):** ```text Apply the following test-suite fixes. Use the critique report as context. <REPORT_SUMMARY> Paste the full report or a concise summary (sections 1–17) here. </REPORT_SUMMARY> <FAILURES> List specific failures, file names, test names, and line references from the report. </FAILURES> <FILES_TO_EDIT> List the test/conftest files to modify (paths under tests/). </FILES_TO_EDIT> Constraints: - **Coverage:** Run the full test suite for coverage; require ≥80% for code under test; cover almost all non-exception lines. - **Exception messages:** For tests that expect exceptions with defined messages, use `pytest.raises(...) as exc_info` and assert message content: `assert "expected substring" in str(exc_info.value)` (or equivalent). Do not only assert that an exception was raised. - **Production code:** Do not modify production code. Fix only tests and conftest files. - **Behavior:** Preserve existing passing behavior; only add or change assertions and test structure as indicated by the report.

Critique Test Suite

Scope

Checklist for Analysis

1. Return values and assertions

Critique Test Suite

Scope

Checklist for Analysis

1. Return values and assertions

2. Success and failure conditions

3. Consistency

4. Completeness

5. Redundancy

6. Parallel execution and isolation

7. Mocking and dependency isolation

8. Config and validation testing

9. Parameterization and data-driven tests

10. Async and concurrency testing

11. Error handling and exception messages

12. State and workflow testing

13. Test data and fixtures

14. Flakiness indicators

15. Regression and documentation

16. Other good practices

17. Code coverage

Output: Report Format

Execution steps

When to use this skill

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing