技能档案

Lifesciences Crispr

Name: Lifesciences Crispr
Author: donbr

Validates synthetic lethality claims from CRISPR knockout screens using BioGRID ORCS 5-phase workflow. This skill should be used when the user asks to "validate synthetic lethality", "query CRISPR essentiality data", "find gene dependencies", "compare cell line screens", or mentions BioGRID ORCS, gene knockout data, essentiality scores (CERES, MAGeCK, BAGEL), or asks to validate claims from published CRISPR papers.

donbr0 星标2026年1月30日

职业
分类: 生物信息学

技能内容

CRISPR Essentiality & Synthetic Lethality Validation

Validate synthetic lethality hypotheses using BioGRID ORCS CRISPR screen data via curl.

Quick Reference

Task	Endpoint	Key Parameters
Get essential screens only	`/gene/{entrez_id}?hit=yes`	`hit=yes` filters to essential
Get all gene screens	`/gene/{entrez_id}`	Returns all screens
Get screen annotations	`/screens/?screenID=1\|2\|3`	Pipe-separated IDs
Find cell line screens	`/screens/?cellLine={name}`	Cell line name

BioGRID ORCS API

IMPORTANT: Use orcsws.thebiogrid.org (NOT )

相关技能

Lifesciences Crispr | Skills Pool

orcs.thebiogrid.org

{
  "SCREEN_ID": "16",
  "IDENTIFIER_ID": "7298",
  "OFFICIAL_SYMBOL": "TYMS",
  "SCORE.1": "100.84",
  "SCORE.2": "-",
  "HIT": "YES",
  "SOURCE": "BioGRID ORCS"
}

# Using HGNC MCP: search_genes → get_gene
# Extract "entrez" from cross_references object
# Example: DHODH → {"cross_references": {"entrez": "1723"}}

# Using Entrez MCP: search_genes
# Top result format: "NCBIGene:1723"
# Extract numeric ID: 1723

# Get ONLY screens where gene is essential (hit=yes)
curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&hit=yes&format=json" > gene_essential.json

# Count essential screens
cat gene_essential.json | jq '. | length'
# Output: 352 (vs 1,446 without hit=yes filter)

# Step 3a: Extract unique screen IDs from essential screens
SCREEN_IDS=$(curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&hit=yes&format=json" | \
  jq -r '.[].SCREEN_ID' | sort -u | head -20 | tr '\n' '|' | sed 's/|$//')

# Step 3b: Get screen annotations (cell line, PubMed, author)
curl -s "https://orcsws.thebiogrid.org/screens/?accesskey=${BIOGRID_API_KEY}&screenID=${SCREEN_IDS}&format=json" | \
  jq '.[] | {SCREEN_ID, SOURCE_ID, AUTHOR, CELL_LINE, PHENOTYPE}'

{
  "SCREEN_ID": "16",
  "SOURCE_ID": "26627737",
  "AUTHOR": "Hart T (2015)",
  "CELL_LINE": "HCT 116",
  "PHENOTYPE": "cell proliferation"
}

# Get essential screens with scores
curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&hit=yes&format=json" | \
  jq '.[] | select(.SCREEN_ID == "16" or .SCREEN_ID == "17") | {SCREEN_ID, SCORE: .["SCORE.1"], symbol: .OFFICIAL_SYMBOL}'

{"SCREEN_ID": "16", "SCORE": "100.84", "symbol": "TYMS"}
{"SCREEN_ID": "17", "SCORE": "107.563", "symbol": "TYMS"}

# Count essential vs total screens
TOTAL=$(curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&format=json" | jq '. | length')
ESSENTIAL=$(curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&hit=yes&format=json" | jq '. | length')
echo "TYMS essential in $ESSENTIAL / $TOTAL screens ($(echo "scale=1; $ESSENTIAL * 100 / $TOTAL" | bc)%)"
# Output: TYMS essential in 352 / 1446 screens (24.3%)

# Get cell lines where gene is essential
curl -s "https://orcsws.thebiogrid.org/gene/7298?accesskey=${BIOGRID_API_KEY}&hit=yes&format=json" | \
  jq -r '.[].SCREEN_ID' | sort -u | head -10 | tr '\n' '|' | sed 's/|$//' > screen_ids.txt

curl -s "https://orcsws.thebiogrid.org/screens/?accesskey=${BIOGRID_API_KEY}&screenID=$(cat screen_ids.txt)&format=json" | \
  jq '.[] | .CELL_LINE' | sort | uniq -c | sort -rn | head -10

Method	Description	Interpretation
CERES	Computational correction for copy number effects	Most common, negative = essential
Kolmogorov-Smirnov	Statistical enrichment test	Log p-value, higher = more significant
MAGeCK	Model-based Analysis of Genome-wide CRISPR	Negative = depletion = essential
BAGEL	Bayesian Analysis of Gene EssentiaLity	Bayes Factor, positive = essential

import requests

BIOGRID_API_KEY = os.getenv("BIOGRID_API_KEY")
BASE_URL = "https://orcsws.thebiogrid.org"

def get_essential_screens(entrez_id: int) -> list[dict]:
    """Get screens where gene is essential (hit=yes)."""
    response = requests.get(
        f"{BASE_URL}/gene/{entrez_id}",
        params={
            "accesskey": BIOGRID_API_KEY,
            "hit": "yes",
            "format": "json"
        }
    )
    return response.json()

# Example: Get TYMS (Entrez 7298) essential screens
essential_screens = get_essential_screens(7298)
print(f"TYMS essential in {len(essential_screens)} screens")
# Output: TYMS essential in 352 screens

def get_screen_annotations(screen_ids: list[str]) -> list[dict]:
    """Get cell line and publication data for screens."""
    response = requests.get(
        f"{BASE_URL}/screens/",
        params={
            "accesskey": BIOGRID_API_KEY,
            "screenID": "|".join(screen_ids),
            "format": "json"
        }
    )
    return response.json()

# Extract unique screen IDs from essential screens
screen_ids = list(set(s["SCREEN_ID"] for s in essential_screens))

# Get annotations (batched)
annotations = get_screen_annotations(screen_ids[:20])  # First 20

for a in annotations[:5]:
    print(f"Screen {a['SCREEN_ID']}: {a['CELL_LINE']} - PMID:{a['SOURCE_ID']} ({a['AUTHOR']})")

def calculate_essentiality_rate(entrez_id: int) -> tuple[int, int, float]:
    """Calculate what % of screens show gene as essential."""
    # Get all screens
    all_screens = requests.get(
        f"{BASE_URL}/gene/{entrez_id}",
        params={"accesskey": BIOGRID_API_KEY, "format": "json"}
    ).json()

    # Get essential screens only
    essential = requests.get(
        f"{BASE_URL}/gene/{entrez_id}",
        params={"accesskey": BIOGRID_API_KEY, "hit": "yes", "format": "json"}
    ).json()

    total = len(all_screens)
    essential_count = len(essential)
    rate = essential_count / total * 100 if total > 0 else 0

    return essential_count, total, rate

# Example
essential, total, rate = calculate_essentiality_rate(7298)
print(f"TYMS: {essential}/{total} screens ({rate:.1f}%) show essentiality")
# Output: TYMS: 352/1446 screens (24.3%) show essentiality

# Phase 1: Use MCP to get Entrez ID
hgnc_result = await client.call_tool("hgnc_search_genes", {"query": "DHODH"})
gene = await client.call_tool("hgnc_get_gene", {"hgnc_id": hgnc_result["items"][0]["id"]})
entrez_id = gene["cross_references"]["entrez"]  # "1723"

# Phase 2-5: Use curl with ORCS (see workflow above)

Lifesciences Crispr

CRISPR Essentiality & Synthetic Lethality Validation

Quick Reference

BioGRID ORCS API

Lifesciences Crispr

CRISPR Essentiality & Synthetic Lethality Validation

Quick Reference

BioGRID ORCS API

Critical Parameter: `hit=yes`

Data Format (JSON)

Screen Annotation Fields

5-Phase Synthetic Lethality Validation Workflow

Phase 1: Resolve Gene Identifiers

Phase 2: Query ORCS for Essential Screens Only

Phase 3: Get Screen Annotations (Cell Lines & PubMed)

Phase 4: Extract Dependency Scores

Phase 5: Compare Across Genetic Backgrounds

Scoring Methods

Common Pitfalls

Complete Example

Python Code Patterns

Query Gene Essentiality

Get Screen Annotations with PubMed

Calculate Essentiality Rate

Integration with Life Sciences MCPs

References

Brenda Database

Clinical Decision Support Documents

Healthcare Cdss Patterns

Nanoclaw Repl

Deep Research

Data Analyst

Lifesciences Crispr

CRISPR Essentiality & Synthetic Lethality Validation

Quick Reference

BioGRID ORCS API

Lifesciences Crispr

CRISPR Essentiality & Synthetic Lethality Validation

Quick Reference

BioGRID ORCS API

Critical Parameter: hit=yes

Data Format (JSON)

Screen Annotation Fields

5-Phase Synthetic Lethality Validation Workflow

Phase 1: Resolve Gene Identifiers

Phase 2: Query ORCS for Essential Screens Only

Phase 3: Get Screen Annotations (Cell Lines & PubMed)

Phase 4: Extract Dependency Scores

Phase 5: Compare Across Genetic Backgrounds

Scoring Methods

Common Pitfalls

Complete Example

Python Code Patterns

Query Gene Essentiality

Get Screen Annotations with PubMed

Calculate Essentiality Rate

Integration with Life Sciences MCPs

References

Brenda Database

Clinical Decision Support Documents

Healthcare Cdss Patterns

Nanoclaw Repl

Deep Research

Data Analyst

Critical Parameter: `hit=yes`