Archivo del skill

Dual-Disease Transcriptomic Machine Learning Research Planner

Name: Dual-Disease Transcriptomic Machine Learning Research Planner
Author: openclaw

Generates complete dual-disease transcriptomic + machine learning research designs from a user-provided disease pair. Use when users want to identify shared DEGs, common hub genes, cross-disease biomarkers, or shared molecular mechanisms between two diseases using public GEO data. Triggers: "shared biomarker study for two diseases", "dual-disease transcriptomic ML paper", "identify common DEGs between disease A and B", "cross-disease hub gene discovery", "shared DEG + PPI + ROC design", "immune infiltration shared biomarker", or "I want to study disease X and Y together". Always outputs four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, and publication upgrade path.

openclaw4,189 estrellas14 mar 2026

Ocupación
Categorías: Bioinformática

Contenido de la habilidad

Generates a complete dual-disease transcriptomic + ML study design from a user-provided disease pair. Always outputs four workload configurations and a recommended primary plan.

Supported Study Styles

Style	Description	Example
A. Shared DEG → Hub Gene Core	DEG overlap → PPI → hub consensus	Intracranial aneurysm + AAA; diabetic + hypertensive nephropathy
B. Dual-Disease Shared Mechanism	Pathway-level convergence	ECM, inflammation, fibrosis linking two diseases
C. PPI + Multi-Algorithm Hub Prioritization	STRING + MCODE + CytoHubba consensus	Any pair with sufficient shared DEGs
D. Dual-Disease Biomarker Validation	ROC in discovery + validation cohorts	Any pair with ≥2 GEO datasets per disease
E. Immune Infiltration + Shared Biomarker	CIBERSORT/alternative + gene–immune correlation

Skills relacionados

Dual-Disease Transcriptomic Machine Learning Research Planner | Skills Pool

Archivo del skill

Dual-Disease Transcriptomic Machine Learning Research Planner

openclaw4,189 estrellas14 mar 2026

Ocupación
Categorías: Bioinformática

Contenido de la habilidad

Generates a complete dual-disease transcriptomic + ML study design from a user-provided disease pair. Always outputs four workload configurations and a recommended primary plan.

Supported Study Styles

Style	Description	Example
A. Shared DEG → Hub Gene Core	DEG overlap → PPI → hub consensus	Intracranial aneurysm + AAA; diabetic + hypertensive nephropathy
B. Dual-Disease Shared Mechanism	Pathway-level convergence	ECM, inflammation, fibrosis linking two diseases
C. PPI + Multi-Algorithm Hub Prioritization	STRING + MCODE + CytoHubba consensus	Any pair with sufficient shared DEGs
D. Dual-Disease Biomarker Validation	ROC in discovery + validation cohorts	Any pair with ≥2 GEO datasets per disease
E. Immune Infiltration + Shared Biomarker	CIBERSORT/alternative + gene–immune correlation

Skills relacionados

Config	Goal	Timeframe	Best For
Lite	Shared DEG + basic hub, 1 dataset per disease	2–4 weeks	Pilot, skeleton manuscript, single-dataset constraint
Standard	Full pipeline + validation + ROC + one deepening layer	5–9 weeks	Core publishable paper
Advanced	Standard + immune + GSEA + multi-cohort robustness	9–14 weeks	Competitive journal target
Publication+	Full multi-layer + experimental suggestions + reviewer defense	12–20 weeks	High-impact submission

EXAMPLE ID convention: All GEO accession numbers in code must carry an inline comment: # EXAMPLE ID — replace with your actual GSE accession before running

Zero-intersection guard: All pipelines must include a feasibility check immediately after DEG intersection:

if (length(shared_genes) == 0) {
  stop("No shared DEGs found. Recovery options: (1) relax logFC to 0.5, (2) use top-500 DEGs per disease, (3) switch to WGCNA co-expression module overlap.")
}

Standard package list: GEOquery, limma, clusterProfiler, org.Hs.eg.db, pROC, igraph, STRINGdb, WGCNA. Provide BiocManager::install() calls where needed.
GEO search pattern: To find valid accession IDs, use GEOquery::getGEO("GSEsearch", ...) or direct search at https://www.ncbi.nlm.nih.gov/geo/

library(GEOquery); library(limma); library(clusterProfiler); library(pROC)

# Load datasets — EXAMPLE IDs: replace before running
gse_disease1 <- getGEO("GSEXXXXX", GSEMatrix = TRUE)[[1]]  # EXAMPLE ID
gse_disease2 <- getGEO("GSEXXXXX", GSEMatrix = TRUE)[[1]]  # EXAMPLE ID

# DEG analysis (repeat for disease2)
design <- model.matrix(~ group, data = pData(gse_disease1))
fit    <- eBayes(lmFit(exprs(gse_disease1), design))
deg_d1 <- subset(topTable(fit, coef = 2, adjust = "BH", number = Inf),
                 abs(logFC) > 1 & adj.P.Val < 0.05)

# Shared DEG intersection with zero-guard
shared_genes <- intersect(rownames(deg_d1), rownames(deg_d2))
if (length(shared_genes) == 0) {
  stop("No shared DEGs found. Recovery: relax logFC to 0.5 or use top-500 DEGs per disease.")
}

# ROC for top hub gene — EXAMPLE: replace 'HUB_GENE' and labels/scores with real data
roc_obj <- roc(response = labels, predictor = expr_scores)
cat("AUC:", auc(roc_obj), "\n")
if (auc(roc_obj) < 0.70) warning("AUC below 0.70 threshold. Consider mini-signature approach.")

File	Content	Used In
references/tissue_and_tool_decisions.md	Tissue prioritization rules by disease class; immune deconvolution tool selection by tissue type	Step 4 (immune module), Step 1
references/geo_search_and_tools.md	GEO dataset search strategy by disease class; bioinformatics tool list with alternatives	Step 4 (dataset module)
references/figure_plan_template.md	Full figure list (Fig 1–8) and table templates (Table 1–4)	Step 5
references/upgrade_path.md	Publication upgrade impact vs complexity table	Step 9

Dual-Disease Transcriptomic Machine Learning Research Planner

Supported Study Styles

Dual-Disease Transcriptomic Machine Learning Research Planner

Supported Study Styles

Minimum User Input

Step-by-Step Execution

Step 1: Infer Study Type

Step 2: Output Four Configurations

Step 4: Full Step-by-Step Workflow

Step 5: Figure Plan

Step 6: Validation and Robustness Plan

Step 7: Risk Review

Step 8: Minimal Executable Version

Step 9: Publication Upgrade Path

R Code Framework Guidelines

Hard Rules

Input Validation

Reference Files

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy

Dual-Disease Transcriptomic Machine Learning Research Planner

Supported Study Styles

Dual-Disease Transcriptomic Machine Learning Research Planner

Supported Study Styles

Minimum User Input

Step-by-Step Execution

Step 1: Infer Study Type

Step 2: Output Four Configurations

Step 3: Recommend One Primary Plan

Step 4: Full Step-by-Step Workflow

Step 5: Figure Plan

Step 6: Validation and Robustness Plan

Step 7: Risk Review

Step 8: Minimal Executable Version

Step 9: Publication Upgrade Path

R Code Framework Guidelines

Hard Rules

Input Validation

Reference Files

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy