Skip to content

搵技能.../

Agent Skill Search Engine

搜尋

搜尋
分類
職業

About

About
Privacy
Terms

© 2026 Skills Pool. All rights reserved.

Researchstack Experiment Design | Skills Pool

技能檔案

Researchstack Experiment Design

Experiment-planning skill for research papers in systems, networking, and AI. Use when Codex must design or audit baselines, metrics, workloads, ablations, statistical checks, scaling studies, sensitivity analysis, and failure tests so that a paper's claims are actually supported.

lqf06242 星標2026年4月13日

職業
分類: 知識庫

技能內容

Experiment Design

Read ../references/workflow.md, ../references/venues.md, and ../references/memory.md.

Before proposing new experiments, load relevant memory if it exists:

known mandatory baselines,
reviewer risks tied to evaluation,
hardware or compute constraints,
past failed experiment branches,
user preferences about fairness, realism, and reporting.

Design evaluation from claims backward.

For each claim, specify:

what evidence type is required,
which baselines are mandatory,
which metrics matter,

相關技能

快速安裝

Researchstack Experiment Design

npx skillvault add lqf0624/lqf0624-researchstack-agents-skills-researchstack-experiment-design-skill-md

下載 Skill 打開源碼倉庫

作者: lqf0624
星標: 2
更新時間: 2026年4月13日
職業

本頁內容

01Experiment Design

what ablation isolates the mechanism,

what stress case could invalidate the claim.

Produce an experiment matrix with columns:

claim,
metric,
workload or dataset,
baseline,
ablation,
expected outcome,
failure interpretation.

Default rigor checks:

fair baseline tuning and budget disclosure,
scale sensitivity,
compute or hardware cost,
robustness to parameter changes,
tail or worst-case behavior if relevant,
negative results worth reporting.

Flag common paper-killing problems:

main claim depends on one cherry-picked setup,
ablations do not isolate causal factors,
systems speedups ignore resource cost,
ML comparisons use stale or weak baselines,
networking evaluation omits adverse or dynamic conditions.

When the session establishes a durable evaluation rule, baseline policy, or failed direction that should not be relearned next time, propose a project-memory entry for it.

Social Scientists and Related Workers, All Other

Notion

Notion API for creating and managing pages, databases, and blocks.

Feishu Wiki

Feishu knowledge base navigation. Activate when user mentions knowledge base, wiki, or wiki links.

Gemini

Gemini CLI for one-shot Q&A, summaries, and generation.

Obsidian Vault Maintainer

Maintain an Obsidian-friendly memory wiki vault with wikilinks, frontmatter, and official Obsidian CLI awareness.

Openclaw Pr Maintainer

Maintainer workflow for reviewing, triaging, preparing, closing, or landing OpenClaw pull requests and related issues. Use when Codex needs to validate bug-fix claims, search for related issues or PRs, apply or recommend close/reason labels, prepare GitHub comments safely, check review-thread follow-up, or perform maintainer-style PR decision making before merge or closure.

Wiki Maintainer

Maintain the OpenClaw memory wiki vault with deterministic pages, managed blocks, and source-backed updates.

Social Scientists and Related Workers, All Other