Skill File

Evaluate Pr Tests

Name: Evaluate Pr Tests
Author: dotnet

Evaluates tests added in a PR for coverage, quality, edge cases, and test type appropriateness. Checks if tests cover the fix, finds gaps, and recommends lighter test types when possible. Prefer unit tests over device tests over UI tests. Triggers on: 'evaluate tests in PR', 'review test quality', 'are these tests good enough', 'check test coverage', 'is this test adequate', 'assess test coverage for PR'.

dotnet23,227 starsMar 25, 2026

Occupation
Categories: Testing

Skill Content

Evaluates the quality, coverage, and appropriateness of tests added in a PR. Produces a structured report with actionable findings.

When to Use

✅ PR has tests and you want to evaluate their quality
⚠️ PR has no test files -- output a ❌ Fix Coverage verdict noting no tests were added; skip remaining criteria
✅ Reviewing whether tests adequately cover the fix
✅ Checking if a lighter test type could be used instead
✅ Before merging a PR, as part of review

Quick Start

# Auto-detect PR and base branch
pwsh .github/skills/evaluate-pr-tests/scripts/Gather-TestContext.ps1

# With explicit base branch
pwsh .github/skills/evaluate-pr-tests/scripts/Gather-TestContext.ps1 -BaseBranch "origin/main"

Workflow

Step 1: Gather Automated Context

Related Skills

Evaluate Pr Tests | Skills Pool

pwsh .github/skills/evaluate-pr-tests/scripts/Gather-TestContext.ps1

// Fix: CollectionView.SelectedItem setter now clears selection when set to null
// Test: Sets SelectedItem to null and verifies selection is cleared
App.Tap("SelectItem");
App.Tap("ClearSelection");  // Sets SelectedItem = null
var text = App.FindElement("SelectionStatus").GetText();
Assert.That(text, Is.EqualTo("None"));  // Directly tests the fix

// Fix: CollectionView.SelectedItem setter
// Test: Just checks CollectionView renders (doesn't test selection clearing)
App.WaitForElement("MyCollectionView");
Assert.That(true);  // Proves nothing about the fix

Gap Type	What to Look For
Null/empty	Does the fix handle null? Is it tested?
Boundary values	Min, max, zero, negative, very large
Repeated actions	Does calling the action twice cause issues?
Platform-specific	Does the bug only occur on certain platforms?
Async/timing	Does the fix involve async code? Race conditions?
State transitions	Does the test cover before→after state changes?
Error paths	What happens when the operation fails?
Combination effects	Does the fix interact with other properties/features?

Priority	Type	When Appropriate	Project
⭐ 1st	Unit Test	Pure logic, property changes, data transformations, binding behavior, event wiring	`*.UnitTests.csproj`
⭐ 1st	XAML Test	XAML parsing, XamlC compilation, source generation, markup extensions	`Controls.Xaml.UnitTests`
⭐⭐ 2nd	Device Test	Platform-specific rendering, native API interaction, handler mapping	`*.DeviceTests.csproj`
⭐⭐⭐ 3rd	UI Test	User interaction flows, visual layout, screenshot comparison, end-to-end scenarios	`TestCases.Shared.Tests`

Does the test need to interact with visual UI elements?
  YES → Is it checking visual layout/appearance?
    YES → UI test (VerifyScreenshot) ✅
    NO  → Could the interaction be tested via handler/control API?
      YES → Device test ⭐⭐
      NO  → UI test ✅
  NO  → Does it need a platform/native context?
    YES → Device test ⭐⭐
    NO  → Does it test XAML parsing/compilation?
      YES → XAML test ⭐
      NO  → Unit test ⭐

Current Test Does	Could Be Instead	Why
UI test: sets property, checks label text	Unit test	Property logic doesn't need UI
UI test: verifies event fires	Unit test	Event wiring is testable in isolation
UI test: checks control doesn't crash	Device test	Don't need Appium for crash testing
UI test: validates XAML binding	XAML test	Binding resolution is compile-time
Device test: checks property default	Unit test	Defaults don't need platform context

Risk Factor	Detection	Mitigation
Arbitrary delays	`Task.Delay`, `Thread.Sleep`	Use `WaitForElement`, `retryTimeout`
Missing waits	`App.Tap` without prior `WaitForElement`	Add explicit waits
Screenshot timing	`VerifyScreenshot()` without `retryTimeout`	Add `retryTimeout: TimeSpan.FromSeconds(2)`
Cursor blink	`Entry`/`Editor` in screenshot test	Use `UITestEntry`/`UITestEditor`
External URLs	WebView loading remote content	Use mock URLs or local content
Animation timing	Visual check after animation	Use `retryTimeout`
Global state	Test modifies `Application.Current`	Ensure cleanup in teardown

Assertion Quality	Example	Verdict
✅ Specific	`Assert.That(label.Text, Is.EqualTo("Expected Value"))`	Catches regression
⚠️ Vague	`Assert.That(label.Text, Is.Not.Null)`	Too permissive
❌ Meaningless	`Assert.That(true)` or no assertion	Proves nothing
✅ Positional	`Assert.That(rect.Y, Is.GreaterThan(safeAreaTop))`	Specific to layout fix
⚠️ Brittle	`Assert.That(rect.Y, Is.EqualTo(47))`	Magic number, will break

## PR Test Evaluation Report

**PR:** #XXXXX — [Title]
**Test files evaluated:** [count]
**Fix files:** [count]

---

### Overall Verdict

[One of: ✅ Tests are adequate | ⚠️ Tests need improvement | ❌ Tests are insufficient]

[1-2 sentence summary of the most important finding]

---

### 1. Fix Coverage — [✅/⚠️/❌]

[Does the test exercise the code paths changed by the fix?]

### 2. Edge Cases & Gaps — [✅/⚠️/❌]

**Covered:**
- [edge case 1]
- [edge case 2]

**Missing:**
- [gap 1 — describe what should be tested and why]
- [gap 2]

### 3. Test Type Appropriateness — [✅/⚠️/❌]

**Current:** [UI Test / Device Test / Unit Test / XAML Test]
**Recommendation:** [Same / Could be lighter — explain why]

### 4. Convention Compliance — [✅/⚠️/❌]

[Summary from automated checks — list only issues found]

### 5. Flakiness Risk — [✅ Low / ⚠️ Medium / ❌ High]

[Specific risk factors identified]

### 6. Duplicate Coverage — [✅ No duplicates / ⚠️ Potential overlap]

[Similar existing tests found, if any]

### 7. Platform Scope — [✅/⚠️/❌]

[Does test coverage match the platforms affected by the fix?]

### 8. Assertion Quality — [✅/⚠️/❌]

[Are assertions specific enough to catch the actual bug?]

### 9. Fix-Test Alignment — [✅/⚠️/❌]

[Do the test and fix target the same code paths?]

---

### Recommendations

1. [Most important actionable recommendation]
2. [Second recommendation]
3. [...]

Problem	Cause	Solution
No changed files detected	Wrong base branch	Use `-BaseBranch` explicitly
No fix files detected	All changes are tests	Expected for test-only PRs
AutomationId mismatch	HostApp and test out of sync	Update one to match the other
Convention check false positive	Script regex too broad	Ignore and note in report

Evaluate Pr Tests

When to Use

Quick Start

Workflow

Step 1: Gather Automated Context

Evaluate Pr Tests

When to Use

Quick Start

Workflow

Step 1: Gather Automated Context

Step 2: Understand the Fix

Step 3: Evaluate the Tests

Step 4: Produce the Report

Evaluation Criteria

1. Fix Coverage

2. Edge Cases & Gaps

3. Test Type Appropriateness

4. Convention Compliance

5. Flakiness Risk

6. Duplicate Coverage

7. Platform Scope

8. Assertion Quality

9. Fix-Test Alignment

Output Format

Output Files

Troubleshooting

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing