Name: rulph
Author: sr-ai-dev

rulph

Iterative rubric-based evaluation and self-improvement loop. Builds a scoring rubric interactively, evaluates an artifact with multiple models in parallel (Codex, Gemini, Claude), then autonomously improves the artifact one criterion at a time until a score threshold is met or circuit breaker fires. "/rulph", "rubric evaluate", "rubric score", "multi-model evaluate", "score and improve", "evaluate and iterate", "grade this", "루브릭 루프", "채점 루프", "자율 개선", "개선 루프", "루브릭 평가"

sr-ai-dev0 스타2026. 4. 16.

카테고리: 교육

Iterative self-improvement skill driven by a user-defined rubric. Builds a scoring rubric interactively, evaluates an artifact with multiple models in parallel, then loops autonomously — improving one criterion at a time — until the score meets the threshold or the circuit breaker fires. No user interaction after Phase 1.

Phase 1: Rubric Building

Build an evaluation rubric through a 3-step interactive process before any scoring begins.

User interaction: Use the AskUserQuestion tool for all user-facing questions in this skill. This ensures the UI renders properly and waits for real user input.

Step 1 — Criteria Collection

Use AskUserQuestion to ask what they are evaluating and what criteria matter. Suggest common categories (code quality, writing quality, system design) but let them describe freely.

After the user responds, parse:

Target: the artifact or output being evaluated (file path, text block, or description)
Criteria: the named dimensions to score (extract from free text or selection)

Require a minimum of 2 criteria. If fewer than 2 are given, prompt again:

rulph

sr-ai-dev0 스타2026. 4. 16.

카테고리: 교육

Phase 1: Rubric Building

Build an evaluation rubric through a 3-step interactive process before any scoring begins.

User interaction: Use the AskUserQuestion tool for all user-facing questions in this skill. This ensures the UI renders properly and waits for real user input.

Step 1 — Criteria Collection

Use AskUserQuestion to ask what they are evaluating and what criteria matter. Suggest common categories (code quality, writing quality, system design) but let them describe freely.

After the user responds, parse:

Target: the artifact or output being evaluated (file path, text block, or description)

Criteria: the named dimensions to score (extract from free text or selection)

Require a minimum of 2 criteria. If fewer than 2 are given, prompt again:

rulph

Phase 1: Rubric Building

Step 1 — Criteria Collection

rulph

Phase 1: Rubric Building

Step 1 — Criteria Collection

Step 2 — Rubric Draft Presentation

Step 3 — Threshold & Floor Setting

Rubric Summary (Evaluation Contract)

Phase 2: Multi-Model Evaluation

CLI Availability Check

Parallel Scoring

Score Aggregation

Phase 3: Improvement Loop

Loop Structure

Pass Check (Threshold + Floor)

Circuit Breaker Check

Improvement Dispatch

Phase 4: Completion

Final Report

Auto-Save Report

Prompt Hardening

Update Skills

Eval Harness

Ecc Tools Cost Audit

Code Tour

Rules Distill

Design System