Comprehensive chemical safety and toxicology assessment integrating ADMET-AI predictions, CTD toxicogenomics, FDA label safety data, DrugBank safety profiles, and STITCH chemical-protein interactions. Performs predictive toxicology (AMES, DILI, LD50, carcinogenicity), organ/system toxicity profiling, chemical-gene-disease relationship mapping, regulatory safety extraction, and environmental hazard assessment. Use when asked about chemical toxicity, drug safety profiling, ADMET properties, environmental health risks, chemical hazard assessment, or toxicogenomic analysis.
Comprehensive chemical safety and toxicology analysis integrating predictive AI models, curated toxicogenomics databases, regulatory safety data, and chemical-biological interaction networks. Generates structured risk assessment reports with evidence grading.
Triggers:
Use Cases:
get_tool_info before calling unfamiliar toolsGrade every toxicity claim by evidence strength:
| Tier | Symbol | Criteria | Examples |
|---|---|---|---|
| T1 | [T1] | Direct human evidence, regulatory finding | FDA boxed warning, clinical trial toxicity, human case reports |
| T2 | [T2] | Animal studies, validated in vitro | Nonclinical toxicology, AMES positive, animal LD50 |
| T3 | [T3] | Computational prediction, association data | ADMET-AI prediction, CTD association, QSAR model |
| T4 | [T4] | Database annotation, text-mined | Literature mention, database entry without validation |
Evidence grades MUST appear in:
Chemical/Drug Query
|
+-- PHASE 0: Compound Disambiguation (ALWAYS FIRST)
| +-- Resolve name -> SMILES, PubChem CID, ChEMBL ID
| +-- Get molecular formula, weight, canonical structure
|
+-- PHASE 1: Predictive Toxicology (ADMET-AI)
| +-- Mutagenicity (AMES)
| +-- Hepatotoxicity (DILI, ClinTox)
| +-- Carcinogenicity
| +-- Acute toxicity (LD50)
| +-- Skin reactions
| +-- Stress response pathways
| +-- Nuclear receptor activity
|
+-- PHASE 2: ADMET Properties
| +-- Absorption: BBB penetrance, bioavailability
| +-- Distribution: clearance, volume of distribution
| +-- Metabolism: CYP interactions (1A2, 2C9, 2C19, 2D6, 3A4)
| +-- Physicochemical: solubility, lipophilicity, pKa
|
+-- PHASE 3: Toxicogenomics (CTD)
| +-- Chemical-gene interactions
| +-- Chemical-disease associations
| +-- Affected biological pathways
|
+-- PHASE 4: Regulatory Safety (FDA Labels)
| +-- Boxed warnings (Black Box)
| +-- Contraindications
| +-- Adverse reactions
| +-- Warnings and precautions
| +-- Nonclinical toxicology
|
+-- PHASE 5: Drug Safety Profile (DrugBank)
| +-- Toxicity data
| +-- Contraindications
| +-- Drug interactions affecting safety
|
+-- PHASE 6: Chemical-Protein Interactions (STITCH)
| +-- Direct chemical-protein binding
| +-- Interaction confidence scores
| +-- Off-target effects
|
+-- PHASE 7: Structural Alerts (ChEMBL)
| +-- Known toxic substructures (PAINS, Brenk)
| +-- Structural alert flags
|
+-- SYNTHESIS: Integrated Risk Assessment
+-- Aggregate all evidence tiers
+-- Risk classification (Low/Medium/High/Critical)
+-- Data gaps and recommendations
CRITICAL: Resolve compound identity before any analysis.
| Input Format | Resolution Strategy |
|---|---|
| Drug name (e.g., "Aspirin") | PubChem_get_CID_by_compound_name -> get SMILES from properties |
| SMILES string | Use directly for ADMET-AI; resolve to CID for other tools |
| PubChem CID | PubChem_get_compound_properties_by_CID -> get SMILES + name |
| ChEMBL ID | ChEMBL_get_molecule -> get SMILES + properties |
PubChem_get_CID_by_compound_name(name=<compound_name>)PubChem_get_compound_properties_by_CID(cid=<cid>)ConnectivitySMILES, CanonicalSMILES, or IsomericSMILES depending on response format)name, smiles, cid, formula, weight, inchi## Compound Identity
| Property | Value |
|----------|-------|
| **Name** | Acetaminophen |
| **PubChem CID** | 1983 |
| **SMILES** | CC(=O)Nc1ccc(O)cc1 |
| **Formula** | C8H9NO2 |
| **Molecular Weight** | 151.16 |
| **InChI** | InChI=1S/C8H9NO2/... |
When: SMILES is available (from Phase 0 or provided directly)
Objective: Run comprehensive AI-predicted toxicity endpoints
All ADMET-AI tools take the same parameter format:
| Tool | Predicted Endpoints | Parameter |
|---|---|---|
ADMETAI_predict_toxicity | AMES, Carcinogens_Lagunin, ClinTox, DILI, LD50_Zhu, Skin_Reaction, hERG | smiles: list[str] |
ADMETAI_predict_stress_response | Stress response pathway activation (ARE, ATAD5, HSE, MMP, p53) | smiles: list[str] |
ADMETAI_predict_nuclear_receptor_activity | AhR, AR, ER, PPARg, Aromatase nuclear receptor activity | smiles: list[str] |
ADMETAI_predict_toxicity(smiles=[resolved_smiles])ADMETAI_predict_stress_response(smiles=[resolved_smiles])ADMETAI_predict_nuclear_receptor_activity(smiles=[resolved_smiles])### Toxicity Predictions [T3]
| Endpoint | Prediction | Interpretation | Concern Level |
|----------|-----------|---------------|---------------|
| AMES Mutagenicity | Inactive | No mutagenic signal | Low |
| Carcinogenicity | Inactive | No carcinogenic signal | Low |
| ClinTox | Active | Clinical toxicity signal | HIGH |
| DILI | Active | Drug-induced liver injury risk | HIGH |
| LD50 (Zhu) | 2.45 log(mg/kg) | ~282 mg/kg (moderate) | Medium |
| Skin Reaction | Inactive | No skin sensitization signal | Low |
| hERG Inhibition | Active | Cardiac arrhythmia risk | HIGH |
*All predictions from ADMET-AI. Evidence tier: [T3] (computational prediction)*
When: SMILES is available
Objective: Full ADMET characterization beyond toxicity
| Tool | Properties Predicted | Parameter |
|---|---|---|
ADMETAI_predict_BBB_penetrance | Blood-brain barrier crossing probability | smiles: list[str] |
ADMETAI_predict_bioavailability | Oral bioavailability (F20%, F30%) | smiles: list[str] |
ADMETAI_predict_clearance_distribution | Clearance, VDss, half-life, PPB | smiles: list[str] |
ADMETAI_predict_CYP_interactions | CYP1A2, 2C9, 2C19, 2D6, 3A4 inhibition/substrate | smiles: list[str] |
ADMETAI_predict_physicochemical_properties | LogP, LogD, LogS, MW, pKa | smiles: list[str] |
ADMETAI_predict_solubility_lipophilicity_hydration | Aqueous solubility, lipophilicity, hydration free energy | smiles: list[str] |
### ADMET Profile [T3]
#### Absorption
| Property | Value | Interpretation |
|----------|-------|----------------|
| BBB Penetrance | Yes | Crosses blood-brain barrier |
| Bioavailability (F20%) | 85% | Good oral absorption |
#### Distribution
| Property | Value | Interpretation |
|----------|-------|----------------|
| VDss | 1.2 L/kg | Moderate tissue distribution |
| PPB | 92% | Highly protein bound |
#### Metabolism
| CYP Enzyme | Substrate | Inhibitor |
|------------|-----------|-----------|
| CYP1A2 | No | No |
| CYP2C9 | Yes | No |
| CYP2C19 | No | No |
| CYP2D6 | No | No |
| CYP3A4 | Yes | Yes (DDI risk) |
#### Excretion
| Property | Value | Interpretation |
|----------|-------|----------------|
| Clearance | 8.5 mL/min/kg | Moderate clearance |
| Half-life | 6.2 h | Moderate half-life |
When: Compound name is resolved
Objective: Map chemical-gene-disease relationships from curated CTD data
| Tool | Function | Parameter |
|---|---|---|
CTD_get_chemical_gene_interactions | Genes affected by chemical | input_terms: str (chemical name) |
CTD_get_chemical_diseases | Diseases linked to chemical exposure | input_terms: str (chemical name) |
CTD_get_chemical_gene_interactions(input_terms=compound_name)CTD_get_chemical_diseases(input_terms=compound_name)### Toxicogenomics (CTD) [T2/T3]
#### Chemical-Gene Interactions (Top 20)
| Gene | Interaction | Type | Evidence |
|------|------------|------|----------|
| CYP1A2 | increases expression | mRNA | [T2] curated |
| TP53 | affects activity | protein | [T2] curated |
| ... | ... | ... | ... |
**Total interactions found**: 156
**Top affected pathways**: Xenobiotic metabolism, Apoptosis, DNA damage response
#### Chemical-Disease Associations (Top 10)
| Disease | Association Type | Evidence |
|---------|-----------------|----------|
| Liver Neoplasms | marker/mechanism | [T2] curated |
| Contact Dermatitis | therapeutic | [T2] curated |
| ... | ... | ... |
When: Compound has an approved drug name
Objective: Extract regulatory safety information from FDA drug labels
| Tool | Information Retrieved | Parameter |
|---|---|---|
FDA_get_boxed_warning_info_by_drug_name | Black box warnings (most serious) | drug_name: str |
FDA_get_contraindications_by_drug_name | Absolute contraindications | drug_name: str |
FDA_get_adverse_reactions_by_drug_name | Known adverse reactions | drug_name: str |
FDA_get_warnings_by_drug_name | Warnings and precautions | drug_name: str |
FDA_get_nonclinical_toxicology_info_by_drug_name | Animal toxicology data | drug_name: str |
FDA_get_carcinogenic_mutagenic_fertility_by_drug_name | Carcinogenicity/mutagenicity/fertility data | drug_name: str |
### Regulatory Safety (FDA) [T1]
#### Boxed Warning
**PRESENT** - Hepatotoxicity risk with doses >4g/day. Liver failure reported. [T1]
#### Contraindications
- Severe hepatic impairment [T1]
- Known hypersensitivity [T1]
#### Adverse Reactions (by frequency)
| Reaction | Frequency | Severity |
|----------|-----------|----------|
| Nausea | Common (>1%) | Mild |
| Hepatotoxicity | Rare (<0.1%) | Severe |
| ... | ... | ... |
#### Nonclinical Toxicology [T2]
- **Carcinogenicity**: No carcinogenic potential in 2-year rat/mouse studies
- **Mutagenicity**: Negative in Ames assay and in vivo micronucleus test
- **Fertility**: No effects on fertility at doses up to 10x human dose
When: Compound is a known drug
Objective: Retrieve curated drug safety data from DrugBank
| Tool | Information | Parameters |
|---|---|---|
drugbank_get_safety_by_drug_name_or_drugbank_id | Toxicity, contraindications | query: str, case_sensitive: bool, exact_match: bool, limit: int |
drugbank_get_safety_by_drug_name_or_drugbank_id(query=drug_name, case_sensitive=False, exact_match=False, limit=5)When: Compound can be identified by name or SMILES
Objective: Map chemical-protein interaction network for off-target assessment
| Tool | Function | Parameters |
|---|---|---|
STITCH_resolve_identifier | Resolve chemical name to STITCH ID | identifier: str, species: int (9606=human) |
STITCH_get_chemical_protein_interactions | Get chemical-protein interactions | identifiers: list[str], species: int, required_score: int |
STITCH_get_interaction_partners | Get interaction network | identifiers: list[str], species: int, limit: int |
STITCH_resolve_identifier(identifier=compound_name, species=9606)STITCH_get_chemical_protein_interactions(identifiers=[stitch_id], species=9606, required_score=700)When: ChEMBL molecule ID is available (from Phase 0)
Objective: Check for known toxic substructures
| Tool | Function | Parameters |
|---|---|---|
ChEMBL_search_compound_structural_alerts | Find structural alert matches | molecule_chembl_id: str, limit: int |
ChEMBL_search_compound_structural_alerts(molecule_chembl_id=chembl_id, limit=20)Always the final section. Integrates all evidence into actionable risk classification.
| Risk Level | Criteria |
|---|---|
| CRITICAL | FDA boxed warning present OR multiple [T1] toxicity findings OR active DILI + active hERG |
| HIGH | FDA warnings present OR [T2] animal toxicity OR multiple active ADMET endpoints |
| MEDIUM | Some [T3] predictions positive OR CTD disease associations OR structural alerts |
| LOW | All ADMET endpoints negative AND no FDA/DrugBank safety flags AND no CTD concerns |
| INSUFFICIENT DATA | Fewer than 3 phases returned data; cannot make confident assessment |
## Integrated Risk Assessment
### Overall Risk Classification: [HIGH]
### Evidence Summary
| Dimension | Finding | Evidence Tier | Concern |
|-----------|---------|--------------|---------|
| ADMET Toxicity | DILI active, hERG active | [T3] | HIGH |
| FDA Label | Boxed warning for hepatotoxicity | [T1] | CRITICAL |
| CTD Toxicogenomics | 156 gene interactions, liver neoplasms | [T2] | HIGH |
| DrugBank | Known hepatotoxicity at high doses | [T2] | HIGH |
| STITCH | Binds CYP3A4, hERG | [T3] | MEDIUM |
| Structural Alerts | 2 Brenk alerts | [T3] | MEDIUM |
### Key Safety Concerns
1. **Hepatotoxicity** [T1]: FDA boxed warning + ADMET-AI DILI prediction + CTD liver disease associations
2. **Cardiac Risk** [T3]: ADMET-AI hERG prediction + STITCH hERG interaction
3. **Drug Interactions** [T3]: CYP3A4 substrate/inhibitor, potential DDI risk
### Data Gaps
- [ ] No in vivo genotoxicity data available
- [ ] STITCH interaction scores moderate (700-900)
- [ ] No environmental exposure data
### Recommendations
1. Avoid doses >4g/day (hepatotoxicity threshold) [T1]
2. Monitor liver function in chronic use [T1]
3. Screen for CYP3A4 interactions before co-administration [T3]
4. Consider cardiac monitoring for at-risk patients [T3]
Before finalizing any report, verify:
Critical Parameter Notes (verified from source code):
| Tool | Parameter Name | Type | Notes |
|---|---|---|---|
| All ADMETAI tools | smiles | list[str] | Always a list, even for single compound |
| All CTD tools | input_terms | str | Chemical name, MeSH name, CAS RN, or MeSH ID |
| All FDA tools | drug_name | str | Brand or generic drug name |
| drugbank_get_safety_* | query, case_sensitive, exact_match, limit | str, bool, bool, int | All 4 required |
| STITCH_resolve_identifier | identifier, species | str, int | species=9606 for human |
| STITCH_get_chemical_protein_interactions | identifiers, species, required_score | list[str], int, int | required_score=400 default |
| PubChem_get_CID_by_compound_name | name | str | Compound name (not SMILES) |
| PubChem_get_compound_properties_by_CID | cid | int | Numeric CID |
| ChEMBL_search_compound_structural_alerts | molecule_chembl_id | str | ChEMBL ID (e.g., "CHEMBL112") |
{status: "success", data: {...}} with prediction values{status, data} with label text{data: [...]} with drug records{IdentifierList: {CID: [...]}} (may or may not have data wrapper)CID, MolecularWeight, ConnectivitySMILES, IUPACNameInput: SMILES string for new molecule
Workflow: Phase 0 (SMILES->CID) -> Phase 1 (toxicity) -> Phase 2 (ADMET) -> Phase 7 (structural alerts) -> Synthesis
Output: Predictive safety profile for novel compound
Input: Drug name (e.g., "Acetaminophen")
Workflow: All phases (0-7 + Synthesis)
Output: Complete safety dossier with regulatory + predictive + database evidence
Input: Chemical name (e.g., "Bisphenol A")
Workflow: Phase 0 -> Phase 1 -> Phase 2 -> Phase 3 (CTD, key for env chemicals) -> Phase 6 -> Synthesis
Output: Environmental health risk assessment focused on gene-disease associations
Input: Multiple SMILES strings
Workflow: Phase 0 -> Phase 1 (batch) -> Phase 2 (batch) -> Comparative table -> Synthesis
Output: Comparative toxicity table ranking compounds by safety
Input: Chemical name + specific gene or disease interest
Workflow: Phase 0 -> Phase 3 (CTD expanded) -> Literature search -> Synthesis
Output: Detailed chemical-gene-disease mechanistic analysis
All analyses generate a structured markdown report with progressive sections:
# Chemical Safety & Toxicology Report: [Compound Name]
**Generated**: YYYY-MM-DD HH:MM
**Compound**: [Name] | SMILES: [SMILES] | CID: [CID]
## Executive Summary
[2-3 sentence overview with risk classification and key findings, all graded]
## 1. Compound Identity
[Phase 0 results - disambiguation table]
## 2. Predictive Toxicology
[Phase 1 results - ADMET-AI toxicity endpoints]
## 3. ADMET Profile
[Phase 2 results - absorption, distribution, metabolism, excretion]
## 4. Toxicogenomics
[Phase 3 results - CTD chemical-gene-disease relationships]
## 5. Regulatory Safety
[Phase 4 results - FDA label information]
## 6. Drug Safety Profile
[Phase 5 results - DrugBank data]
## 7. Chemical-Protein Interactions
[Phase 6 results - STITCH network]
## 8. Structural Alerts
[Phase 7 results - ChEMBL alerts]
## 9. Integrated Risk Assessment
[Synthesis - risk classification, evidence summary, data gaps, recommendations]
## Appendix: Methods and Data Sources
[Tool versions, databases queried, date of access]
Chemical Safety & Toxicology Assessment Skill provides comprehensive safety evaluation by integrating:
Outputs: Structured markdown report with risk classification, evidence grading, and actionable recommendations
Best for: Drug safety assessment, chemical hazard profiling, environmental toxicology, ADMET characterization, toxicogenomic analysis
Total tools integrated: 25+ tools across 6 databases