Skill File

Experiment Craft

Name: Experiment Craft
Author: HiGuiyu

Use this skill when the user wants to debug, diagnose, or systematically iterate on an experiment that already exists, or when they need a structured experiment log for tracking runs, hypotheses, failures, results, and next steps during active research. Apply it to underperforming methods, training that will not converge, regressions after a change, inconsistent results across datasets, aimless experimentation without progress, and questions like 'why doesn't this work?', 'no progress after many attempts', or 'how should I investigate this failure?'. Also use it for setting up practical experiment logging/record-keeping that supports debugging and iteration. Do not use it for designing a brand-new experiment pipeline or full experiment program (use experiment-pipeline), generating research ideas, fixing isolated coding/syntax errors, or writing retrospective summaries into research memory/notes/knowledge bases.

HiGuiyu0 starsApr 3, 2026

Occupation
Categories: Lab Tools

Skill Content

A systematic approach to running, debugging, and iterating on research experiments. The critical skill is not running more experiments — it's understanding WHY experiments fail.

When to Use This Skill

User's experiment is not working or producing unexpected results
User needs help diagnosing why a method fails on certain data
User wants to organize their experiment process with structured logging
User asks about debugging research code or iterating on approaches
User mentions "experiment debugging", "why doesn't this work", "experiment log", "results are wrong"

This skill is typically loaded from within experiment-pipeline when a stage attempt fails. After debugging, return to the pipeline's stage-gate structure to continue. Can also be used standalone for any experiment debugging.

The Debugging Mindset

Finding WHY experiments fail is the most critical research skill. Not analyzing results leads to two failure modes:

Related Skills

Experiment Craft | Skills Pool

Skill File

Experiment Craft

HiGuiyu0 starsApr 3, 2026

Occupation
Categories: Lab Tools

Skill Content

A systematic approach to running, debugging, and iterating on research experiments. The critical skill is not running more experiments — it's understanding WHY experiments fail.

When to Use This Skill

User's experiment is not working or producing unexpected results
User needs help diagnosing why a method fails on certain data
User wants to organize their experiment process with structured logging
User asks about debugging research code or iterating on approaches
User mentions "experiment debugging", "why doesn't this work", "experiment log", "results are wrong"

This skill is typically loaded from within experiment-pipeline when a stage attempt fails. After debugging, return to the pipeline's stage-gate structure to continue. Can also be used standalone for any experiment debugging.

The Debugging Mindset

Finding WHY experiments fail is the most critical research skill. Not analyzing results leads to two failure modes:

Related Skills

Section	What to Record
Purpose	Why you're running this experiment; what you expect to learn
Setting	Data, algorithm changes, hyperparameters — everything needed to reproduce
Results	Quantitative metrics + qualitative observations + specific good/failure cases
Analysis	Do results match expectations? If not, hypothesized causes ranked by likelihood
Next Steps	What to do based on the analysis — YOU are the project leader

Artifact	Source	Used By
Final experiment results (tables and figures)	Experiment logs	Experiments section
Ablation study results	Diagnostic experiments	Ablation tables
Failure case analysis	Step 1 + Step 3	Limitations discussion
Key implementation details and tricks	Steps 3-5	Method section / Supplementary
Baseline comparison results	Step 2	Comparison tables

Topic	Reference File	When to Use
Debugging methodology	debugging-methodology.md	Diagnosing why experiments fail
Experiment log template	experiment-log-template.md	Recording experiment details

Experiment Craft

When to Use This Skill

The Debugging Mindset

Experiment Craft

When to Use This Skill

The Debugging Mindset

5-Step Diagnostic Flow

Step 1: Collect Failure Cases

Step 2: Find a Working Version

Step 3: Bridge the Gap

Step 4: Hypothesize and Verify

Step 5: Propose and Implement a Fix

Counterintuitive Experiment Rules

Experiment Logging

Return to experiment-pipeline

Handoff to Paper Writing

Reference Navigation

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio