When to use

Invoked in a loop by the frontier-mapping pipeline — once per paper. Given a single paper's task chain + reported success rate + scope of evaluation, classify the demonstrated capability into one of three tiers from research_guideline.md §5.1 Axis 3:

Reliable — works consistently across objects, environments, conditions. Production-viable today. Examples (2026): pick-and-place of rigid objects; language-conditioned foundation-model manipulation for coarse tasks.
Sometimes — works in demonstrated conditions but degrades off- distribution. Examples: deformable manipulation; tool use; multi- step with recovery; in-hand dexterity; bimanual; tactile-guided insertion.
Can't yet — demonstrations are preliminary or the capability is not achieved. Examples: general unstructured manipulation; transparent/ reflective objects; contact-rich with force reasoning at scale; long- horizon physical causal reasoning; human-level dexterity.

This skill does NOT make tool calls — it is pure reasoning on provided input.

Process

Step 1 — Receive input

When to use

Reliable — works consistently across objects, environments, conditions. Production-viable today. Examples (2026): pick-and-place of rigid objects; language-conditioned foundation-model manipulation for coarse tasks.
Sometimes — works in demonstrated conditions but degrades off- distribution. Examples: deformable manipulation; tool use; multi- step with recovery; in-hand dexterity; bimanual; tactile-guided insertion.
Can't yet — demonstrations are preliminary or the capability is not achieved. Examples: general unstructured manipulation; transparent/ reflective objects; contact-rich with force reasoning at scale; long- horizon physical causal reasoning; human-level dexterity.

This skill does NOT make tool calls — it is pure reasoning on provided input.

Classify Capability

When to use

Process

Step 1 — Receive input

Classify Capability

When to use

Process

Step 1 — Receive input

Step 2 — Apply classification heuristics

Step 3 — Produce output

Output format

Honesty protocol

References

Update Skills

Eval Harness

Ecc Tools Cost Audit

Code Tour

Rules Distill

Design System