Comprehensive antibody engineering and optimization for therapeutic development. Covers humanization, affinity maturation, developability assessment, and immunogenicity prediction. Use when asked to optimize antibodies, humanize sequences, or engineer therapeutic antibodies from lead to clinical candidate.
AI-guided antibody optimization pipeline from preclinical lead to clinical candidate. Covers sequence humanization, structure modeling, affinity optimization, developability assessment, immunogenicity prediction, and manufacturing feasibility.
KEY PRINCIPLES:
When uncertain about any scientific fact, SEARCH databases first (PubMed, UniProt, ChEMBL, ClinVar, etc.) rather than reasoning from memory. A database-verified answer is always more reliable than a guess.
Apply when user asks:
antibody_optimization_report.mdoptimized_sequences.fasta - All optimized variantshumanization_comparison.csv - Before/after comparisondevelopability_assessment.csv - Detailed scoresSee REPORT_TEMPLATE.md for the full report template with section formats.
Every optimization MUST include per-variant documentation with:
| Tool | Purpose | Category |
|---|---|---|
IMGT_search_genes | Germline gene identification | Humanization |
IMGT_get_sequence | Human framework sequences | Humanization |
SAbDab_search_structures | Antibody structure precedents | Structure |
TheraSAbDab_search_by_target | Clinical antibody benchmarks | Validation |
alphafold_get_prediction | Structure modeling | Structure |
iedb_search_epitopes | Epitope identification | Immunogenicity |
iedb_search_bcell | B-cell epitope prediction | Immunogenicity |
UniProt_get_entry_by_accession | Target antigen information | Target |
STRING_get_interaction_partners | Protein interaction network | Bispecifics |
PubMed_search_articles | Literature precedents | Validation |
CRITICAL: SOAP tools (IMGT, SAbDab, TheraSAbDab) require an operation parameter. See QUICK_START.md for correct usage.
Phase 1: Input Analysis & Characterization
├── Sequence annotation (CDRs, framework)
├── Species identification
├── Target antigen identification
├── Clinical precedent search
└── OUTPUT: Input characterization
↓
Phase 2: Humanization Strategy
├── Germline gene alignment (IMGT)
├── Framework selection
├── CDR grafting design
├── Backmutation identification
└── OUTPUT: Humanization plan
↓
Phase 3: Structure Modeling & Analysis
├── AlphaFold prediction
├── CDR conformation analysis
├── Epitope mapping
├── Interface analysis
└── OUTPUT: Structural assessment
↓
Phase 4: Affinity Optimization
├── In silico mutation screening
├── CDR optimization strategies
├── Interface improvement
└── OUTPUT: Affinity variants
↓
Phase 5: Developability Assessment
├── Aggregation propensity
├── PTM site identification
├── Stability prediction
├── Expression prediction
└── OUTPUT: Developability score
↓
Phase 6: Immunogenicity Prediction
├── MHC-II epitope prediction (IEDB)
├── T-cell epitope risk
├── Aggregation-related immunogenicity
└── OUTPUT: Immunogenicity risk score
↓
Phase 7: Manufacturing Feasibility
├── Expression level prediction
├── Purification considerations
├── Formulation stability
└── OUTPUT: Manufacturing assessment
↓
Phase 8: Final Report & Recommendations
├── Ranked variant list
├── Experimental validation plan
├── Next steps
└── OUTPUT: Comprehensive report
Goal: Annotate sequences, identify species/germline, find clinical precedents.
Key steps:
IMGT_search_genesTheraSAbDab_search_by_targetUniProt_get_entry_by_accessionOutput: Sequence information table, CDR annotation, target info, clinical precedent list.
See WORKFLOW_DETAILS.md Phase 1 for code examples.
Goal: Select human framework, design CDR grafting, identify backmutations.
Key steps:
Output: Framework selection rationale, grafting design, backmutation analysis, humanized sequences.
See WORKFLOW_DETAILS.md Phase 2 for code examples.
Goal: Predict structure, analyze CDR conformations, map epitope.
Key steps:
alphafold_get_prediction (VH:VL)iedb_search_epitopesSAbDab_search_structuresOutput: Structure quality table, CDR conformation analysis, epitope mapping, structural comparison.
See WORKFLOW_DETAILS.md Phase 3 for code examples.
Goal: Design affinity-improving mutations via computational screening.
Key steps:
Output: Ranked mutation list, combination strategy, expected affinity improvements.
See WORKFLOW_DETAILS.md Phase 4 for code examples.
Goal: Comprehensive developability scoring (0-100) across five dimensions.
Key steps:
Scoring: Weighted average (aggregation 0.30, PTM 0.25, stability 0.20, expression 0.15, solubility 0.10). Tiers: T1 (>75), T2 (60-75), T3 (<60).
Output: Component scores, overall score, tier classification, mitigation recommendations.
See WORKFLOW_DETAILS.md Phase 5 and CHECKLISTS.md for scoring details.
Goal: Predict immunogenicity risk and design deimmunization strategy.
Key steps:
Output: T-cell epitope list, risk score breakdown, deimmunization strategy, clinical comparison.
See WORKFLOW_DETAILS.md Phase 6 for code examples.
Goal: Assess expression, purification, formulation, and CMC feasibility.
Key steps:
Output: Expression assessment, purification strategy, formulation recommendation, CMC timeline.
See MANUFACTURING.md for detailed manufacturing content and WORKFLOW_DETAILS.md Phase 7 for code.
Goal: Compile all findings into a ranked recommendation with validation plan.
Key outputs:
See REPORT_TEMPLATE.md for the full report template.
IMGT_search_genes: Search germline genes (IGHV, IGKV, etc.)IMGT_get_sequence: Get germline sequencesIMGT_get_gene_info: Database informationSAbDab_search_structures: Search antibody structuresSAbDab_get_structure: Get structure detailsTheraSAbDab_search_therapeutics: Search by nameTheraSAbDab_search_by_target: Search by target antigeniedb_search_epitopes: Search epitopesiedb_search_bcell: B-cell epitopesiedb_search_mhc: MHC-II epitopesiedb_get_epitope_references: Citationsalphafold_get_prediction: Structure predictionUniProt_get_entry_by_accession: Target infoRCSBData_get_entry: Experimental structuresSTRING_get_interaction_partners: Protein interactionsSTRING_get_enrichment: Pathway analysis| File | Contents |
|---|---|
QUICK_START.md | Getting started guide, SOAP tool parameters, Python SDK and MCP usage |
WORKFLOW_DETAILS.md | Code examples for all 8 phases |
REPORT_TEMPLATE.md | Full report template with section formats and example tables |
MANUFACTURING.md | Detailed manufacturing content (expression, purification, formulation, CMC) |
EXAMPLES.md | Complete clinical scenario examples (humanization, affinity, bispecific) |
CHECKLISTS.md | Evidence grading, completeness checklists, scoring details, special considerations |