Name: Multi-Omics Integration
Author: Nam4802

Multi-Omics Integration

Integrate and analyze multiple omics datasets (transcriptomics, proteomics, epigenomics, genomics, metabolomics) for systems biology and precision medicine. Performs cross-omics correlation, multi-omics clustering (MOFA+, NMF), pathway-level integration, and sample matching. Coordinates ToolUniverse skills for expression data (RNA-seq), epigenomics (methylation, ChIP-seq), variants (SNVs, CNVs), protein interactions, and pathway enrichment. Use when analyzing multi-omics datasets, performing integrative analysis, discovering multi-omics biomarkers, studying disease mechanisms across molecular layers, or conducting systems biology research that requires coordinated analysis of transcriptome, genome, epigenome, proteome, and metabolome data.

Nam48020 스타2026. 3. 26.

직업
카테고리: 생물정보학

Coordinate and integrate multiple omics datasets for comprehensive systems biology analysis. This skill orchestrates specialized ToolUniverse skills to perform cross-omics correlation, multi-omics clustering, pathway-level integration, and unified interpretation across molecular layers.

When to Use This Skill

Triggers:

User has multiple omics datasets (RNA-seq + proteomics, methylation + expression, etc.)
Requests for integrative multi-omics analysis
Cross-omics correlation queries (e.g., "How does methylation affect expression?")
Multi-omics biomarker discovery
Systems biology questions requiring multiple molecular layers
Precision medicine applications with multi-omics patient data
Questions about molecular mechanisms across omics types

Example Questions This Skill Solves:

"Integrate RNA-seq and proteomics data to find genes with concordant changes"
"How does promoter methylation correlate with gene expression?"
"Perform multi-omics clustering to identify patient subtypes"
"Which pathways are dysregulated across transcriptome, proteome, and metabolome?"

Multi-Omics Integration

Nam48020 스타2026. 3. 26.

직업
카테고리: 생물정보학

When to Use This Skill

Triggers:

User has multiple omics datasets (RNA-seq + proteomics, methylation + expression, etc.)

Requests for integrative multi-omics analysis

Cross-omics correlation queries (e.g., "How does methylation affect expression?")

Multi-omics biomarker discovery

Systems biology questions requiring multiple molecular layers

Precision medicine applications with multi-omics patient data

Questions about molecular mechanisms across omics types

Example Questions This Skill Solves:

"Integrate RNA-seq and proteomics data to find genes with concordant changes"

"How does promoter methylation correlate with gene expression?"

"Perform multi-omics clustering to identify patient subtypes"

"Which pathways are dysregulated across transcriptome, proteome, and metabolome?"

# Multi-Omics Integration Report ## Dataset Summary - **Omics Types**: RNA-seq, Proteomics, Methylation, CNV - **Common Samples**: 45 patients (30 disease, 15 control) - **Features**: 15,000 genes, 5,000 proteins, 450K CpGs, 20K CNV regions ## Cross-Omics Correlation ### RNA-Protein Correlation - **Overall correlation**: r = 0.52 (expected: 0.4-0.6) - **Highly correlated**: 3,245 genes (45%) - **Discordant genes**: 890 genes (post-transcriptional regulation) ### Methylation-Expression - **Promoter methylation**: Anticorrelation r = -0.41 - **Epigenetically regulated genes**: 1,256 genes (p < 0.01) - **Example**: BRCA1 promoter hypermethylation → 3-fold reduced expression ### CNV-Expression Dosage Effect - **Genes with dosage effect**: 445 genes (r > 0.5, p < 0.01) - **Example**: MYC amplification (3 copies) → 2.8-fold increased expression ## Multi-Omics Clustering ### MOFA+ Analysis - **Factor 1** (25% variance): Cell cycle genes (RNA + protein) - **Factor 2** (18% variance): Immune signature (RNA + methylation) - **Factor 3** (15% variance): Metabolic reprogramming (RNA + metabolites) ### Patient Subtypes - **Subtype 1** (n=18): High proliferation, MYC amplification - **Subtype 2** (n=15): Immune-enriched, hypomethylation - **Subtype 3** (n=12): Metabolic dysregulation, mitochondrial dysfunction ## Pathway Integration ### Top Dysregulated Pathways (Multi-Omics Score) 1. **Cell Cycle** (score: 8.5) - RNA (↑), Protein (↑), CNV (amplification) 2. **Immune Response** (score: 7.2) - RNA (↑), Methylation (hypo) 3. **Glycolysis** (score: 6.8) - RNA (↑), Metabolites (↑) ## Multi-Omics Biomarkers ### Classification Performance - **AUC**: 0.92 ± 0.04 (5-fold CV) - **Features**: 50 total (20 RNA, 15 protein, 10 methylation, 5 CNV) - **Top biomarkers**: - MYC expression (RNA) - CDK1 protein abundance - BRCA1 promoter methylation - TP53 CNV status ## Biological Interpretation The multi-omics analysis reveals three distinct disease subtypes driven by different molecular mechanisms: 1. **Proliferative subtype**: Characterized by MYC amplification driving coordinated upregulation of cell cycle genes at both RNA and protein levels. 2. **Immune subtype**: Hypomethylation of immune genes leading to increased expression and T-cell infiltration. 3. **Metabolic subtype**: Shift from oxidative phosphorylation to glycolysis, with concordant changes in enzyme expression and metabolite levels. These subtypes may respond differently to targeted therapies.

Capability	Description
Data Integration	Match samples across omics, handle missing data, normalize scales
Cross-Omics Correlation	Correlate features across molecular layers (gene expression vs protein, methylation vs expression)
Multi-Omics Clustering	MOFA+, NMF, joint clustering to identify omics-driven subtypes
Pathway Integration	Combine omics evidence at pathway level for unified biological interpretation
Biomarker Discovery	Identify multi-omics signatures with improved predictive power
Skill Coordination	Orchestrate RNA-seq, epigenomics, variant-analysis, protein-interactions, gene-enrichment skills
Visualization	Circos plots, integrated heatmaps, network visualizations
Reporting	Unified multi-omics reports with cross-layer insights

Skill	Used For	Phase
`tooluniverse-rnaseq-deseq2`	Load and analyze RNA-seq data	Phase 1, 4
`tooluniverse-epigenomics`	Methylation analysis, ChIP-seq peaks	Phase 1, 4
`tooluniverse-variant-analysis`	CNV and SNV processing	Phase 1, 3, 4
`tooluniverse-protein-interactions`	Protein network context	Phase 6
`tooluniverse-gene-enrichment`	Pathway enrichment	Phase 6
`tooluniverse-expression-data-retrieval`	Public omics data retrieval	Phase 1
`tooluniverse-target-research`	Gene/protein annotation	Phase 3, 8

Component	Requirement
Omics types	At least 2 omics datasets
Common samples	At least 10 samples across omics
Cross-correlation	Pearson/Spearman correlation computed
Clustering	At least one method (MOFA+, NMF, or SNF)
Pathway integration	Enrichment with multi-omics evidence scores
Report	Summary, correlations, clusters, pathways, biomarkers

Multi-Omics Integration

When to Use This Skill

Multi-Omics Integration

When to Use This Skill

Core Capabilities

Workflow Overview

Phase Details

Phase 1: Data Loading & Quality Control

Phase 2: Sample Matching

Phase 3: Feature Mapping

Phase 4: Cross-Omics Correlation

4.1: Expression vs Protein (Translation Efficiency)

4.2: Methylation vs Expression (Epigenetic Regulation)

4.3: CNV vs Expression (Dosage Effect)

Phase 5: Multi-Omics Clustering

Phase 6: Pathway-Level Integration

Phase 7: Biomarker Discovery

Phase 8: Integrated Reporting

ToolUniverse Skills Coordination

Example Use Cases

Use Case 1: Cancer Multi-Omics

Use Case 2: eQTL + Expression

Use Case 3: Drug Response Multi-Omics

Advanced Analysis Patterns

Pattern 1: Omics-Driven Patient Stratification

Pattern 2: Multi-Omics Network Analysis

Pattern 3: Temporal Multi-Omics

Pattern 4: Spatial Multi-Omics

Quantified Minimums

Limitations

References

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy