스킬 파일

Downstream Agent Skills Generator

Name: Downstream Agent Skills Generator
Author: conradry

Auto-generates a SKILL.md file for a processed dataset so future Claude Code sessions can discover and analyze it. Use after preprocessing succeeds and validation passes, or when a user says "create a skill for this dataset", "make this dataset available in future sessions", or "register this processed data".

conradry0 스타2026. 3. 8.

직업
카테고리: 스크립팅

스킬 내용

Purpose

Auto-generates a SKILL.md file for a processed dataset so that future Claude Code sessions can "know" about the dataset and what analyses are available. This enables persistent dataset awareness across sessions.

When to Use

Invoke after dataset-preprocessing-workflow completes successfully and result-schema-validator passes. Generates a skill file that describes the processed dataset.

Workflow Steps

Step 1: Read Processed Dataset Metadata

Load the processed h5ad file and extract metadata:

import anndata as ad
adata = ad.read_h5ad("<processed.h5ad>")
metadata = adata.uns["preprocessing"]

Key metadata to extract:

Cell count, gene count
Perturbation key and available perturbations

관련 스킬

Downstream Agent Skills Generator | Skills Pool

# Dataset: <dataset_name>

## Source
- **Paper**: <title, DOI>
- **Data accession**: <GEO ID>
- **Perturbation type**: <chemical|genetic_crispr|genetic_rnai|combinatorial>

## Dataset Summary
- **Cells**: <N> (after QC)
- **Genes**: <N> (HVGs)
- **Perturbations**: <list of perturbation conditions>
- **Control**: <control label>
- **Clusters**: <N>
- **DE performed**: yes/no

## File Location
`<path/to/processed.h5ad>`

## Available Analyses
Based on preprocessing results, these analyses are available:

### Quick Queries
- "Show top DE genes for <perturbation> vs <control>"
- "Compare clusters between <perturbation_1> and <perturbation_2>"
- "Show UMAP colored by perturbation condition"

### Deeper Analysis
- Pathway enrichment on DE genes
- Perturbation similarity/clustering
- Cell type composition changes per perturbation
- <route-specific analyses from perturbation-type-router>

## How to Load
```python
import scanpy as sc
adata = sc.read_h5ad("<path>")
# Access DE results: adata.uns["rank_genes_groups"]
# Access embeddings: adata.obsm["X_umap"], adata.obsm["X_pca"]
# Access raw counts: adata.layers["counts"]


### Step 3: Write Skill File
Save the generated SKILL.md to a dataset-specific skill directory:


Naming convention for `<dataset_name>`:
- Use GEO accession if available: `GSE12345`
- Otherwise: `<first_author>_<year>_<perturbation_type>`
- Lowercase, underscores, no spaces

### Step 4: Register in Index
Append the new dataset skill to a dataset index file:


Format:
```markdown
# Processed Dataset Index

| Dataset | Type | Cells | Perturbations | Path |
|---------|------|-------|---------------|------|
| GSE12345 | chemical | 5000 | 10 compounds | .claude/skills/datasets/GSE12345/SKILL.md |

Downstream Agent Skills Generator

Purpose

When to Use

Workflow Steps

Step 1: Read Processed Dataset Metadata

Downstream Agent Skills Generator

Purpose

When to Use

Workflow Steps

Step 1: Read Processed Dataset Metadata

Step 2: Generate Skill Content

Preprocessing Parameters

Output

Dependencies

Prose

Coding Agent (bash-first)

Create Prompt

Strategic Compact

Strategic Compact

Strategic Compact