GWAS Study Deep Dive & Meta-Analysis | Skills Pool
Skill ファイル
GWAS Study Deep Dive & Meta-Analysis
Compare GWAS studies, perform meta-analyses, and assess replication across cohorts. Integrates NHGRI-EBI GWAS Catalog and Open Targets Genetics to compare study designs, effect sizes, ancestry diversity, and heterogeneity statistics. Use when comparing GWAS studies for a trait, performing meta-analysis of genetic loci, assessing replication across cohorts, or exploring the genetic architecture of complex diseases.
mims-harvard1,271 スター2026/03/29
職業
カテゴリ
バイオインフォマティクス
スキル内容
Compare GWAS studies, perform meta-analyses, and assess replication across cohorts
Overview
The GWAS Study Deep Dive & Meta-Analysis skill enables comprehensive comparison of genome-wide association studies (GWAS) for the same trait, meta-analysis of genetic loci across studies, and systematic assessment of replication and study quality. It integrates data from the NHGRI-EBI GWAS Catalog and Open Targets Genetics to provide a complete picture of the genetic architecture of complex traits.
Key Capabilities
Study Comparison: Compare all GWAS studies for a trait, assessing sample sizes, ancestries, and platforms
Meta-Analysis: Aggregate effect sizes across studies and calculate heterogeneity statistics
Replication Assessment: Identify replicated vs novel findings across discovery and replication cohorts
Quality Evaluation: Assess statistical power, ancestry diversity, and data availability
COMPUTE, DON'T DESCRIBE
関連 Skill
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Domain Reasoning: Comparing Studies for the Same Trait
When comparing GWAS studies for the same trait, ask: do they replicate? The same lead SNPs appearing in independent studies is strong evidence of a true association. Different lead SNPs at the same locus may reflect LD differences between populations — they may tag the same causal variant. Different loci entirely may reflect different study designs, phenotype definitions, or population ancestry. Before concluding that a finding failed to replicate, check whether the SNP was even genotyped or imputed in the replication cohort.
LOOK UP DON'T GUESS: effect sizes, p-values, allele frequencies, and LD structure for specific loci. Do not assume a SNP present in one study is present in another — use gwas_get_associations_for_snp to retrieve cross-study data. Do not infer LD blocks from genomic proximity; use credible sets from Open Targets for fine-mapping results.
Use Cases
1. Comprehensive Trait Analysis
Scenario: "I want to understand all available GWAS data for type 2 diabetes"
Workflow:
Search for all T2D studies in GWAS Catalog
Filter by sample size and ancestry
Extract top associations from each study
Identify consistently replicated loci
Assess ancestry-specific effects
Outcome: Complete landscape of T2D genetics with replicated findings and population-specific signals
2. Locus-Specific Meta-Analysis
Scenario: "Is the TCF7L2 association with T2D consistent across all studies?"
Workflow:
Retrieve all TCF7L2 (rs7903146) associations for T2D
Calculate combined effect size and p-value
Assess heterogeneity (I² statistic)
Generate forest plot data
Interpret heterogeneity level
Outcome: Quantitative assessment of effect size consistency with heterogeneity interpretation
3. Replication Analysis
Scenario: "Which findings from the discovery cohort replicated in the independent sample?"
Workflow:
Get top hits from discovery study
Check for presence and significance in replication study
Assess direction consistency
Calculate replication rate
Identify novel vs failed replication
Outcome: Systematic replication report with success rates and failed findings
4. Multi-Ancestry Comparison
Scenario: "Are T2D loci consistent across European and East Asian populations?"
Workflow:
Filter studies by ancestry
Compare top associations between populations
Identify shared vs population-specific loci
Assess allele frequency differences
Evaluate transferability of genetic risk scores
Outcome: Ancestry-specific genetic architecture with transferability assessment
Statistical Methods
Meta-Analysis Approach
This skill implements standard GWAS meta-analysis methods:
Fixed-Effects Model:
Used when heterogeneity is low (I² < 25%)
Weights studies by inverse variance
Assumes true effect size is the same across studies
Random-Effects Model (recommended when I² > 50%):
Accounts for between-study variation
More conservative than fixed-effects
Better for diverse ancestries or methodologies
Heterogeneity Assessment:
The I² statistic measures the percentage of variance due to between-study heterogeneity: