Name: Query Understanding Workflow
Author: conradry

Skills suchen.../

Query Understanding Workflow | Skills Pool

Type	Description	Example
`mechanism`	How does a perturbation work?	"How does KRAS knockout affect downstream signaling?"
`comparison`	Compare perturbations or conditions	"Compare dexamethasone vs prednisolone in A549 cells"
`dose-response`	Dose or time-dependent effects	"What happens to TP53 targets at different nutlin-3a doses?"
`screening`	Large-scale perturbation screen results	"What are the top hits from a genome-wide CRISPR screen in K562?"
`dataset-search`	Find relevant datasets	"Find CRISPR screens in lung cancer cell lines"
`analysis`	Analyze a specific dataset	"Run differential expression on this perturbation dataset"

Type	Indicators
`chemical`	Drug names, compound IDs, dose mentions, MOA references
`genetic_crispr`	CRISPR, Cas9, sgRNA, guide RNA, knockout, KO
`genetic_rnai`	RNAi, shRNA, siRNA, knockdown, KD
`combinatorial`	Multiple perturbations, combinations, synergy, interaction
`unknown`	Insufficient information to classify

{
  "raw_query": "<original user question>",
  "entities": {
    "genes": ["<gene symbols>"],
    "drugs": ["<drug/compound names>"],
    "cell_types": ["<cell lines or types>"],
    "diseases": ["<disease names>"],
    "organisms": ["<species, default 'Homo sapiens'>"],
    "perturbation_agents": ["<specific constructs if mentioned>"]
  },
  "question_type": "<mechanism|comparison|dose-response|screening|dataset-search|analysis>",
  "perturbation_type": "<chemical|genetic_crispr|genetic_rnai|combinatorial|unknown>",
  "search_terms": ["<derived search keywords for paper/dataset retrieval>"],
  "filters": {
    "organism": "<species filter>",
    "data_availability": "<true if user wants downloadable data>",
    "year_range": [null, null]
  },
  "confidence": {
    "entity_extraction": "<0.0-1.0>",
    "question_classification": "<0.0-1.0>",
    "perturbation_classification": "<0.0-1.0>"
  }
}

{
  "raw_query": "What are the effects of KRAS knockout in A549 cells?",
  "entities": {
    "genes": ["KRAS"],
    "drugs": [],
    "cell_types": ["A549"],
    "diseases": [],
    "organisms": ["Homo sapiens"],
    "perturbation_agents": []
  },
  "question_type": "mechanism",
  "perturbation_type": "genetic_crispr",
  "search_terms": ["KRAS", "knockout", "A549", "CRISPR"],
  "filters": {
    "organism": "Homo sapiens",
    "data_availability": true,
    "year_range": [null, null]
  },
  "confidence": {
    "entity_extraction": 0.95,
    "question_classification": 0.85,
    "perturbation_classification": 0.90
  }
}

Query Understanding Workflow

Purpose

When to Use

Workflow Steps

Step 1: Entity Extraction

Query Understanding Workflow

Purpose

When to Use

Workflow Steps

Step 1: Entity Extraction

Step 2: Question Type Classification

Step 3: Perturbation Type Detection

Step 4: Build Structured Query Object

Step 5: Ambiguity Resolution

Output

Examples

Step 6: Resolve Identifiers Against DB (Optional)

Dependencies

Deep Research

Data Analyst

Academic Researcher

Data Scientist

Biopython

Binary Analysis Patterns