Name: Structural Variant Analysis Workflow
Author: Nam4802

Structural Variant Analysis Workflow

Comprehensive structural variant (SV) analysis skill for clinical genomics. Classifies SVs (deletions, duplications, inversions, translocations), assesses pathogenicity using ACMG-adapted criteria, evaluates gene disruption and dosage sensitivity, and provides clinical interpretation with evidence grading. Use when analyzing CNVs, large deletions/duplications, chromosomal rearrangements, or any structural variants requiring clinical interpretation.

Nam48020 estrellas26 mar 2026

Ocupación
Categorías: Bioinformática

Systematic analysis of structural variants (deletions, duplications, inversions, translocations, complex rearrangements) for clinical genomics interpretation using ACMG-adapted criteria.

KEY PRINCIPLES:

Report-first approach - Create SV_analysis_report.md FIRST, then populate progressively
ACMG-style classification - Pathogenic/Likely Pathogenic/VUS/Likely Benign/Benign with explicit evidence
Evidence grading - Grade all findings by confidence level (★★★/★★☆/★☆☆)
Dosage sensitivity critical - Gene dosage effects drive SV pathogenicity
Breakpoint precision matters - Exact gene disruption vs dosage-only effects
Population context essential - gnomAD SVs for frequency assessment
English-first queries - Always use English terms in tool calls (gene names, disease names), even if the user writes in another language. Only try original-language terms as a fallback. Respond in the user's language

Problem This Skill Solves

Structural variants (SVs) present unique interpretation challenges:

Systematic analysis of structural variants (deletions, duplications, inversions, translocations, complex rearrangements) for clinical genomics interpretation using ACMG-adapted criteria.

KEY PRINCIPLES:

Report-first approach - Create SV_analysis_report.md FIRST, then populate progressively
ACMG-style classification - Pathogenic/Likely Pathogenic/VUS/Likely Benign/Benign with explicit evidence
Evidence grading - Grade all findings by confidence level (★★★/★★☆/★☆☆)
Dosage sensitivity critical - Gene dosage effects drive SV pathogenicity
Breakpoint precision matters - Exact gene disruption vs dosage-only effects
Population context essential - gnomAD SVs for frequency assessment
English-first queries - Always use English terms in tool calls (gene names, disease names), even if the user writes in another language. Only try original-language terms as a fallback. Respond in the user's language

Problem This Skill Solves

Structural variants (SVs) present unique interpretation challenges:

┌─────────────────────────────────────────────────────────────────┐ │ STRUCTURAL VARIANT INTERPRETATION │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ Phase 1: SV IDENTITY & CLASSIFICATION │ │ ├── Normalize SV coordinates (hg19/hg38) │ │ ├── Determine SV type (DEL/DUP/INV/TRA/CPX) │ │ ├── Calculate SV size │ │ └── Assess breakpoint precision │ │ │ │ Phase 2: GENE CONTENT ANALYSIS │ │ ├── Identify genes fully contained in SV │ │ ├── Identify genes with breakpoints (disrupted) │ │ ├── Annotate gene function and disease associations │ │ ├── Identify regulatory elements affected │ │ └── Assess gene orientation (for inversions/translocations) │ │ │ │ Phase 3: DOSAGE SENSITIVITY ASSESSMENT │ │ ├── ClinGen dosage sensitivity scores │ │ │ └─ Haploinsufficiency / Triplosensitivity ratings │ │ ├── DECIPHER haploinsufficiency predictions │ │ ├── pLI scores (gnomAD) for loss-of-function intolerance │ │ ├── OMIM gene-disease associations (dominant/recessive) │ │ └── Known dosage-sensitive genes from literature │ │ │ │ Phase 4: POPULATION FREQUENCY CONTEXT │ │ ├── gnomAD SV database (overlapping SVs) │ │ ├── DGV (Database of Genomic Variants) │ │ ├── ClinVar (known pathogenic/benign SVs) │ │ └── Calculate reciprocal overlap with population SVs │ │ │ │ Phase 5: PATHOGENICITY SCORING │ │ ├── Pathogenicity score (0-10 scale) │ │ │ ├─ Gene content weight (40%) │ │ │ ├─ Dosage sensitivity weight (30%) │ │ │ ├─ Population frequency weight (20%) │ │ │ └─ Inheritance/phenotype match weight (10%) │ │ ├── Apply ACMG SV criteria │ │ └── Generate classification recommendation │ │ │ │ Phase 6: LITERATURE & CLINICAL EVIDENCE │ │ ├── PubMed: Similar SVs, gene disruption studies │ │ ├── DECIPHER: Developmental disorder cases │ │ ├── Clinical case reports │ │ └── Functional evidence for gene dosage effects │ │ │ │ Phase 7: ACMG-ADAPTED CLASSIFICATION │ │ ├── Apply SV-specific evidence codes │ │ ├── Calculate final classification │ │ ├── Identify limiting factors │ │ └── Generate clinical recommendations │ │ │ └─────────────────────────────────────────────────────────────────┘

def assess_dosage_sensitivity(tu, gene_list): """ Assess dosage sensitivity for all genes in SV. Returns dosage scores and interpretation. """ dosage_data = [] for gene_symbol in gene_list: # 1. ClinGen dosage sensitivity (gold standard) clingen = tu.tools.ClinGen_search_dosage_sensitivity( gene=gene_symbol ) hi_score = None ts_score = None if clingen.get('data'): for entry in clingen['data']: hi_score = entry.get('Haploinsufficiency Score') ts_score = entry.get('Triplosensitivity Score') break # 2. ClinGen gene validity (supports dosage sensitivity) validity = tu.tools.ClinGen_search_gene_validity( gene=gene_symbol ) validity_level = None if validity.get('data'): for entry in validity['data']: validity_level = entry.get('Classification') break # 3. pLI score from gnomAD (if available via gene search) # Note: May need to use myvariant or other tools # pli_score = get_pli_score(tu, gene_symbol) # 4. OMIM inheritance pattern omim = tu.tools.OMIM_search( operation="search", query=gene_symbol, limit=3 ) inheritance_pattern = None if omim.get('data', {}).get('entries'): for entry in omim['data']['entries']: mim = entry.get('mimNumber') details = tu.tools.OMIM_get_entry( operation="get_entry", mim_number=str(mim) ) # Extract inheritance from details # inheritance_pattern = parse_inheritance(details) # Integrate evidence dosage_assessment = { 'gene': gene_symbol, 'hi_score': hi_score, 'ts_score': ts_score, 'validity_level': validity_level, 'inheritance': inheritance_pattern, 'is_dosage_sensitive': (hi_score == '3' or ts_score == '3'), 'evidence_grade': calculate_evidence_grade(hi_score, ts_score, validity_level) } dosage_data.append(dosage_assessment) return dosage_data def calculate_evidence_grade(hi_score, ts_score, validity): """ Calculate evidence grade for dosage sensitivity. """ if (hi_score == '3' or ts_score == '3') and validity == 'Definitive': return '★★★' # High confidence elif (hi_score in ['2', '3'] or ts_score in ['2', '3']): return '★★☆' # Moderate confidence else: return '★☆☆' # Low confidence

Type	Abbreviation	Description	Molecular Effect
Deletion	DEL	Loss of genomic segment	Haploinsufficiency, gene disruption
Duplication	DUP	Gain of genomic segment	Triplosensitivity, gene dosage imbalance
Inversion	INV	Segment flipped in orientation	Gene disruption at breakpoints, position effects
Translocation	TRA	Segment moved to different chromosome	Gene fusions, disruption, position effects
Complex	CPX	Multiple rearrangement types	Variable effects

Tool	Purpose	Key Data
`Ensembl_lookup_gene`	Gene structure, coordinates	Gene boundaries, exons, transcripts
`NCBI_gene_search`	Gene information	Official symbol, aliases, description
`Gene_Ontology_get_term_info`	Gene function	Biological process, molecular function
`OMIM_search`, `OMIM_get_entry`	Disease associations	Inheritance, clinical features
`DisGeNET_search_gene`	Gene-disease associations	Evidence scores

Tool	Purpose	Key Data
`ClinGen_search_dosage_sensitivity`	Gold standard curation	HI/TS scores (0-3)
`ClinGen_search_gene_validity`	Gene-disease validity	Definitive/Strong/Moderate
`gnomad_search` (pLI)	Loss-of-function intolerance	pLI score (0-1)
`DECIPHER_search`	Developmental disorders	Patient phenotypes with similar SVs
`OMIM_get_entry`	Inheritance pattern	AD/AR indicates dosage sensitivity

Score	Haploinsufficiency (HI)	Triplosensitivity (TS)	Interpretation
3	Sufficient evidence	Sufficient evidence	Gene IS dosage-sensitive
2	Emerging evidence	Emerging evidence	Likely dosage-sensitive
1	Little evidence	Little evidence	Insufficient evidence
0	No evidence	No evidence	No established dosage sensitivity

pLI Range	Interpretation	LoF Intolerance
≥0.9	Extremely intolerant	High - likely haploinsufficient
0.5-0.9	Moderately intolerant	Moderate
<0.5	Tolerant	Low - likely NOT haploinsufficient

Structural Variant Analysis Workflow

Problem This Skill Solves

Structural Variant Analysis Workflow

Problem This Skill Solves

Triggers

Workflow Overview

Phase Details

Phase 1: SV Identity & Classification

Phase 2: Gene Content Analysis

Phase 3: Dosage Sensitivity Assessment

Phase 4: Population Frequency Context

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy

Tool	Purpose	Key Data
`gnomad_search`	Population SV frequencies	Overlapping SVs, frequencies
`ClinVar_search_variants`	Known pathogenic/benign SVs	Classification, review status
`DECIPHER_search`	Patient SVs with phenotypes	Case reports, phenotype similarity

SV Frequency	ACMG Code	Interpretation
≥1% in gnomAD SVs	BA1 (Stand-alone Benign)	Too common for rare disease
0.1-1%	BS1 (Strong Benign)	Likely benign common variant
<0.01%	PM2 (Supporting Pathogenic)	Rare, supports pathogenicity
Absent	PM2 (Supporting)	Very rare, supports pathogenicity