Name: Mutation Testing
Author: dljsjr

Mutation Testing

Use this skill whenever mutation testing is relevant. Triggers: "mutation testing", "mutation score", "mutant", "mutants", "PIT", "pitest", "cargo-mutants", "Stryker", "stryker-mutator", "mutmut", "mutation operators", "killed mutant", "survived mutant", "equivalent mutant", "test quality", "test effectiveness", "test the tests", "mutation analysis", "are my tests good enough", "is my coverage meaningful", "do my tests actually catch bugs". Also trigger when verifying test thoroughness, setting up CI quality gates for test effectiveness, or discussing why 100% code coverage is insufficient. Covers theory, all major frameworks, result interpretation, and the critical insight that survived mutants almost always mean "improve your tests" not "fix your code". Always consult before doing mutation testing work — it prevents the most common agent mistake of treating survived mutants like regular test failures.

dljsjr0 starsApr 7, 2026

Occupation
Categories: Testing

Mutation testing measures test suite quality by injecting small faults (mutations) into production code and checking whether the existing tests catch them. If unit tests and integration tests test your code, mutation tests test your tests.

This is a supplemental testing technique — it does not replace unit tests, integration tests, or any other form of testing. It validates that existing tests are effective.

The Critical Mental Model

Read this section carefully. It is the most important part of this skill.

When a mutation test reports a "survived mutant" (a fault that your tests didn't catch), the correct response is almost always to improve the test suite, not to change the production code. The production code is presumably correct — the mutation tool deliberately broke it, and your tests failed to notice. The fix is a better test.

This is the opposite of normal test failure semantics, where a failing test usually means the code is wrong. Agents that don't internalize this distinction will waste enormous effort "fixing" production code that was never broken.

The rare exception: Sometimes a survived mutant reveals that the production code has ambiguous semantics that make it genuinely hard to test. For example, a Rust function returning where means invalid input, means valid input with no result, and means valid input with a result. The mutation tool replaces the body with and tests don't catch it because the distinction between "invalid" and "empty valid" isn't observable. The fix here is to refactor the code — e.g., to — making the semantics explicit and testable. But this is the exception, not the rule. When in doubt, improve the tests first.

Mutation Testing

dljsjr0 starsApr 7, 2026

Occupation
Categories: Testing

The Critical Mental Model

Read this section carefully. It is the most important part of this skill.

Language	Framework	Reference file	When to read
Java / JVM	PIT (pitest)	`references/pit-java.md`	Any .java, .kt, .scala on JVM
Rust	cargo-mutants	`references/cargo-mutants-rust.md`	Any .rs files, Cargo projects
JS / TS	StrykerJS	`references/stryker-js-ts.md`	Any .js, .ts, .jsx, .tsx
C# / .NET	Stryker.NET	`references/stryker-dotnet.md`	Any .cs, .NET projects
Scala	Stryker4s	`references/stryker-scala.md`	Scala projects (sbt)
Python	mutmut	`references/mutmut-python.md`	Any .py files

Mutation Testing

The Critical Mental Model

Mutation Testing

The Critical Mental Model

How It Works

Mutation Operator Categories

Framework-Specific References

Interpreting Results: The Decision Tree

1. Valuable Survivor → Write a Better Test

2. Equivalent Mutant → Exclude or Accept (last resort)

3. Noisy Survivor → Exclude via Configuration

4. Design-Revealing Survivor → Refactor the Code

Setting Thresholds

Performance: Making Mutation Testing Practical

Workflow: When the User Asks You to Do Mutation Testing

Step 1: Identify the Framework

Step 2: Verify Baseline Tests Pass

Step 3: Run Mutation Testing

Step 4: Triage Survivors

Step 5: Improve Tests (Not Code)

Step 6: Re-run and Verify

Common Agent Mistakes to Avoid

Combining with Other Testing Techniques

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing