Analyze HLA genes, MHC binding, epitope-MHC associations, and immunogenomics for transplant compatibility, vaccine design, and immunotherapy. Integrates IMGT, IEDB, BVBRC, UniProt, and DGIdb. Use for HLA typing interpretation, antigen presentation analysis, MHC restriction, neoantigen prediction context, and transplant immunology.
Pipeline for exploring HLA gene families, MHC-peptide binding, epitope associations, and their clinical implications in transplantation, vaccine development, and cancer immunotherapy. Bridges immunogenetic databases (IMGT, IEDB) with functional annotation (UniProt) and druggability data (DGIdb).
HLA analysis is fundamentally about peptide presentation: the polymorphism of HLA molecules determines which peptides are displayed to T cells, which in turn governs disease susceptibility, transplant rejection, drug hypersensitivity, and vaccine immunogenicity. HLA type affects disease susceptibility for autoimmune conditions (HLA-B27 and ankylosing spondylitis), transplant rejection (HLA mismatch drives alloresponse), drug hypersensitivity (abacavir causes severe hypersensitivity reactions only in HLA-B*57:01 carriers), and vaccine design (epitopes must be presented by the recipient's HLA alleles to elicit a T-cell response). Class I and Class II HLA molecules have fundamentally different binding grooves, peptide lengths, and T-cell partners — never conflate them. The absence of an epitope from IEDB means it has not been tested, not that it cannot bind.
LOOK UP DON'T GUESS: Never assume an allele's binding properties or population frequency — query IEDB for experimental binding data and IMGT for allele annotation. Do not guess which HLA alleles are common in a population; look up published frequency data via PubMed.
Guiding principles:
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
Typical triggers:
Not this skill: For full neoantigen prediction pipelines, use tooluniverse-immunotherapy-response-prediction. For general gene function lookup, use tooluniverse-drug-target-validation.
| Database | Scope | Best For |
|---|---|---|
| IMGT | International ImMunoGeneTics; HLA/MHC gene nomenclature and sequences | Authoritative HLA gene info, allele nomenclature, sequence data |
| IEDB | Immune Epitope Database; experimentally validated epitope-MHC data | Epitope binding, MHC restriction, T-cell assay results |
| BVBRC | BV-BRC (formerly PATRIC/IRD); pathogen epitopes | Pathogen-derived epitopes with host MHC context |
| UniProt | Protein function and structure annotations | HLA protein features, domains, variants |
| DGIdb | Drug-Gene Interaction Database | Druggability of HLA-pathway genes |
| PubMed | Biomedical literature | Clinical HLA studies, transplant outcomes |
Phase 0: Query Parsing & HLA Disambiguation
Resolve allele names, identify MHC class, confirm species
|
Phase 1: HLA Gene Lookup
IMGT gene info, allele details, sequence data
|
Phase 2: MHC Binding & Restriction
IEDB MHC binding data, allele-specific peptide repertoire
|
Phase 3: Epitope-MHC Associations
IEDB/BVBRC epitope search, pathogen-specific epitopes
|
Phase 4: Functional Annotation
UniProt protein features, structural domains
|
Phase 5: Clinical & Therapeutic Context
DGIdb druggability, PubMed clinical evidence
|
Phase 6: Report Synthesis
Integrated immunogenomics report
Parse the user's input to identify:
HLA nomenclature quick reference:
HLA-A*02:01 = gene A, allele group 02, specific protein 01Objective: Get authoritative gene and allele information from IMGT.
Tools:
IMGT_search_genes -- search for HLA/MHC genes
query (gene name or keyword), optional species, locusIMGT_get_gene_info -- get detailed gene/allele information
gene_name (IMGT gene name)Workflow:
If allele not found: Check nomenclature -- older names may have been reassigned. Try searching by the gene name alone (e.g., "HLA-A") and filtering results.
Objective: Find what peptides bind to a specific MHC molecule, or what MHC molecules present a given peptide.
Tools:
iedb_search_mhc -- search for MHC molecules in IEDB
mhc_restriction (allele name), optional mhc_classiedb_get_epitope_mhc -- get MHC binding details for an epitope
epitope_id (IEDB epitope ID)Workflow:
Binding affinity interpretation (Class I):
Objective: Find epitopes from specific pathogens or antigens and their MHC restriction.
Tools:
iedb_search_epitopes -- search for experimentally validated epitopes
organism_name (source organism), source_antigen_name (protein name)BVBRC_search_epitopes -- search pathogen-derived epitopes
query (pathogen or antigen keyword), optional host, limitWorkflow:
Important: IEDB epitopes are experimentally validated, not predicted. The absence of an epitope does not mean it won't bind -- it may simply be untested.
Population coverage for vaccine design: When selecting epitopes for a vaccine, check how common the restricting HLA allele is in the target population. An epitope restricted to HLA-A*02:01 covers ~50% of Europeans but <15% of some African populations. For broad population coverage, select epitopes across multiple HLA supertypes (A2, A3, B7, B44 cover >95% of most populations).
Objective: Get protein-level features for HLA molecules and related proteins.
Tools:
UniProt_search -- search for HLA protein entries
query (protein/gene name), optional organism, limitWorkflow:
Objective: Connect HLA findings to drug interactions and clinical evidence.
Tools:
DGIdb_get_drug_gene_interactions -- find drugs targeting HLA-pathway genes
genes (list of gene names, e.g., ["HLA-A", "B2M"])PubMed_search_articles -- find clinical HLA studies
query (search term), optional limitWorkflow:
Well-known HLA-drug associations (for context, always verify with current data):
Structure the report as: