Integrate statistical analysis results with biological knowledge from ToolUniverse tools. After computing associations or differential expression, use pathway analysis, literature search, drug-target databases, and variant annotation to interpret findings biologically. Use when statistical results need biological context, when users want to go beyond p-values to understand mechanisms, or when combining data analysis with literature evidence.
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do -- execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Bridge the gap between statistical results and biological understanding. After any computational analysis produces significant findings, this skill teaches how to interpret them using ToolUniverse's biological knowledge tools -- the key advantage over platforms that only do data analysis.
IMPORTANT: Always use English terms in tool calls (gene names, pathway names, organism names), even if the user writes in another language. Respond in the user's language.
Apply when:
NOT for (use other skills instead):
tooluniverse-statistical-modeling or tooluniverse-rnaseq-deseq2tooluniverse-gene-enrichmenttooluniverse-literature-deep-researchtooluniverse-variant-interpretationMap each type of significant finding to the right biological question:
| Finding Type | Biological Question | Tool Discovery Query |
|---|---|---|
| Significant gene list | What pathways are enriched? What functions converge? | find_tools("gene enrichment pathway analysis") |
| Significant variant (rsID) | What is the functional impact? Which gene is affected? | find_tools("variant annotation functional impact") |
| Significant exposure/chemical | What is the biological mechanism? Which pathways? | find_tools("chemical gene pathway toxicology") |
| Significant drug association | What is the molecular target? What is the MOA? | find_tools("drug target mechanism action") |
| Significant metabolite | Which metabolic pathway is perturbed? | find_tools("metabolite pathway identification") |
Key principle: Do not stop at "gene X is significant." Ask: significant in what context? Through what mechanism? With what downstream consequence?
For each significant finding, query multiple sources and synthesize. The pattern:
Evidence grading (grade each piece of evidence):
| Grade | Source Type | Example |
|---|---|---|
| T1 (Strong) | Randomized clinical trial, Mendelian randomization | "RCT showed drug X reduces outcome Y" |
| T2 (Moderate) | Large cohort study, GWAS with replication | "GWAS meta-analysis in 500k subjects" |
| T3 (Suggestive) | Case-control study, animal model | "Mouse knockout shows phenotype" |
| T4 (Hypothesis) | In silico prediction, pathway inference | "Network analysis suggests involvement" |
Statistical association is not causation. Apply these reasoning frameworks:
DAG construction: Before interpreting, sketch the causal directed acyclic graph (DAG).
Triangulation: The same finding supported by different methods with different biases strengthens causal inference.
Mendelian randomization logic: Genetic variants (instruments) are assigned at conception, so they are not confounded by lifestyle or reverse causation. If a genetic variant that increases exposure X also increases disease Y, this supports X causing Y. Check instrument strength (F-statistic > 10), exclusion restriction (variant affects Y only through X), and pleiotropy (MR-Egger intercept).
Mediation analysis: If gene G is associated with both exposure and outcome, ask: does the exposure effect on outcome go through G? Use the finding's pathway context (Step 2) to propose mediators, then check if adjusting for the mediator attenuates the effect.
Before reporting a finding as robust, attempt to falsify it:
Structure the integrated report as follows:
For each significant finding, produce one row:
| Finding | Statistical Evidence | Biological Mechanism | Literature Support | Genetic Support | Evidence Grade |
|---|---|---|---|---|---|
| Gene X upregulated | FDR=0.001, log2FC=2.3 | PI3K/AKT pathway | 12 papers, 2 RCTs | GWAS: rs123 (p=5e-8) | Strong |
| Variant rs456 | OR=1.4, p=2e-6 | Splicing disruption | 3 case reports | eQTL in GTEx | Moderate |