Name: Multi-Omics Integration
Author: mims-harvard

Multi-Omics Integration

Integrate and analyze multiple omics datasets (transcriptomics, proteomics, epigenomics, genomics, metabolomics) for systems biology and precision medicine. Performs cross-omics correlation, multi-omics clustering (MOFA+, NMF), pathway-level integration, and sample matching. Coordinates ToolUniverse skills for expression data (RNA-seq), epigenomics (methylation, ChIP-seq), variants (SNVs, CNVs), protein interactions, and pathway enrichment. Use when analyzing multi-omics datasets, performing integrative analysis, discovering multi-omics biomarkers, studying disease mechanisms across molecular layers, or conducting systems biology research that requires coordinated analysis of transcriptome, genome, epigenome, proteome, and metabolome data.

mims-harvard1,271 Sterne29.03.2026

Beruf
Kategorien: Bioinformatik

Coordinate and integrate multiple omics datasets for comprehensive systems biology analysis. Orchestrates specialized ToolUniverse skills to perform cross-omics correlation, multi-omics clustering, pathway-level integration, and unified interpretation.

Domain Reasoning

Multi-omics integration asks whether different molecular layers tell a concordant story. If a gene is upregulated in RNA-seq AND its protein is elevated in proteomics, that is concordant evidence of true biological change. Discordance — high mRNA but low protein, or elevated protein without matching mRNA — may indicate post-transcriptional regulation (miRNA silencing, protein degradation, translational control) and is itself a meaningful finding worth reporting. Not every discordance is noise; some are the most interesting biology.

LOOK UP DON'T GUESS

Expected RNA-protein correlation ranges: compute Spearman r from the actual data; the typical range (0.4-0.6) is a guide, not a guarantee.
Pathway enrichment results: run ReactomeAnalysis_pathway_enrichment or gseapy on the actual gene lists; never list enriched pathways from memory.
eQTL associations: query GTEx or eQTL databases for the specific variant and tissue; do not assume regulatory relationships.

Multi-Omics Integration

mims-harvard1,271 Sterne29.03.2026

Beruf
Kategorien: Bioinformatik

Domain Reasoning

LOOK UP DON'T GUESS

Expected RNA-protein correlation ranges: compute Spearman r from the actual data; the typical range (0.4-0.6) is a guide, not a guarantee.

Pathway enrichment results: run ReactomeAnalysis_pathway_enrichment or gseapy on the actual gene lists; never list enriched pathways from memory.

eQTL associations: query GTEx or eQTL databases for the specific variant and tissue; do not assume regulatory relationships.

Omics	Formats	QC Focus
Transcriptomics	CSV/TSV, HDF5, h5ad	Low-count filter, normalize (TPM/DESeq2), log-transform
Proteomics	MaxQuant, Spectronaut, DIA-NN	Missing value imputation, median/quantile normalization
Methylation	IDAT, beta matrices	Failed probes, batch correction, cross-reactive filter
Genomics	VCF, SEG (CNV)	Variant QC, CNV segmentation
Metabolomics	Peak tables	Missing values, normalization

Method	Description	Best For
MOFA+	Latent factors explaining cross-omics variation	Identifying shared/omics-specific drivers
Joint NMF	Shared decomposition across omics	Patient subtype discovery
SNF	Similarity network fusion	Integrating heterogeneous data types

Skill	Used For	Phase
`tooluniverse-rnaseq-deseq2`	RNA-seq analysis	1, 4
`tooluniverse-epigenomics`	Methylation, ChIP-seq	1, 4
`tooluniverse-variant-analysis`	CNV/SNV processing	1, 3, 4
`tooluniverse-protein-interactions`	Protein network context	6
`tooluniverse-gene-enrichment`	Pathway enrichment	6
`tooluniverse-expression-data-retrieval`	Public data retrieval	1
`tooluniverse-target-research`	Gene/protein annotation	3, 8

Component	Requirement
Omics types	At least 2 datasets
Common samples	At least 10 across omics
Cross-correlation	Pearson/Spearman computed
Clustering	At least one method (MOFA+, NMF, or SNF)
Pathway integration	Enrichment with multi-omics evidence scores
Report	Summary, correlations, clusters, pathways, biomarkers

Multi-Omics Integration

Domain Reasoning

LOOK UP DON'T GUESS

Multi-Omics Integration

Domain Reasoning

LOOK UP DON'T GUESS

When to Use This Skill

Workflow Overview

Supported Data Types

Core Operations

Sample Matching

Cross-Omics Correlation

Pathway Integration

Multi-Omics Clustering Methods

ToolUniverse Skills Coordination

Use Cases

Cancer Multi-Omics

eQTL + Expression + Methylation

Drug Response Multi-Omics

Quantified Minimums

Limitations

References

Detailed Reference

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy