Name: QTL Candidate Gene Analysis
Author: warelab

QTL Candidate Gene Analysis

Comprehensive QTL candidate gene analysis using the Gramene/Sorghumbase MCP server. Expression-driven pipeline: identifies genes in a QTL region, profiles their expression and their orthologs' expression across tissues, connects expression patterns to the trait of interest, then supplements with functional annotations, literature, LOF alleles, and copy number variation. Produces a ranked candidate list and interactive HTML report. Use this skill whenever the user asks about: QTL analysis, candidate gene identification, genes in a genomic interval, prioritizing genes under a QTL peak, trait-associated region analysis, or any request that combines positional gene data with functional annotation for gene ranking. Also trigger when the user mentions a chromosomal region and a trait together, even casually (e.g., "what's interesting on chr3 55-58Mb for yield in sorghum").

warelab0 스타2026. 3. 27.

직업
카테고리: 생물정보학

You are running a multi-step candidate gene analysis for a QTL (quantitative trait locus) region. The goal is to identify, annotate, and rank genes under a QTL peak to help a plant biologist prioritize candidates for experimental validation.

The analysis uses the Gramene MCP server tools. Read the AGENT_PROMPT.md system prompt if you haven't already — it documents every tool, field, and workflow pattern you'll need.

Guiding Principle: Expression First

Gene expression is the most informative single signal for QTL candidate prioritization. A gene that is highly and specifically expressed in the trait-relevant tissue — and whose ortholog in a well-studied species shows the same pattern — is far more likely to be causal than a gene identified only by position and annotation keywords.

The pipeline below is structured so that expression profiling happens early and drives the ranking. The supplementary analyses (VEP, CNV, enrichment, literature) refine and contextualize the expression-based ranking but don't replace it.

Input Resolution

The user will provide one of:

A. Direct coordinates: chromosome, start, end, and species. Parse these and proceed to the gene scan.

QTL Candidate Gene Analysis

warelab0 스타2026. 3. 27.

직업
카테고리: 생물정보학

Guiding Principle: Expression First

Trait category	PO terms (integer IDs)	Tissues
Grain yield / seed	9001, 9089, 9010	fruit/grain, endosperm, seed
Plant height / growth	20142, 25029, 9047	stem internode, shoot, stem
Flowering time	9051, 9049, 6310	spikelet, inflorescence, flower
Root traits	9005, 25025, 20127	root, root system, primary root
Drought / abiotic stress	25034, 9005	leaf, root
Grain quality	9001, 9089	fruit/grain, endosperm

Criterion	Weight	Points	How to score
Expression in trait-relevant tissue	HIGH	0–4	4 = top 10% in trait tissue + tissue-specific; 3 = top quartile; 2 = expressed above median; 1 = detectable; 0 = not expressed or no data
Conserved ortholog expression	HIGH	0–3	3 = rice/maize ortholog shows same tissue-specific pattern; 2 = ortholog expressed in relevant tissue but not specific; 1 = ortholog expressed elsewhere; 0 = no ortholog data
Differential expression	HIGH	0–3	3 = significantly DE in target species under trait condition; 2 = DE in ortholog; 1 = marginally significant; 0 = not DE
Published functional evidence	MEDIUM	0–3	3 = gene/ortholog directly studied for this trait; 2 = ortholog functionally characterized for related trait; 1 = mentioned in literature; 0 = no pubs
Trait ontology / GO / pathway	MEDIUM	0–3	3 = direct TO match + relevant GO/pathway; 2 = relevant GO or pathway only; 1 = tangentially related; 0 = no match
Gene description relevance	LOW	0–2	2 = description directly implies trait-related function; 1 = plausibly related; 0 = uncharacterized or unrelated
LOF germplasm available	LOW	0–1	1 = EMS or natural LOF alleles exist; 0 = none
CNV/PAV variation	LOW	0–1	1 = copy number variable across genomes; 0 = conserved single copy

QTL Candidate Gene Analysis

Guiding Principle: Expression First

Input Resolution

QTL Candidate Gene Analysis

Guiding Principle: Expression First

Input Resolution

Pipeline Steps

Step 1: Gene Scan and Ortholog Resolution

Step 2: Expression Profiling (Primary Analysis)

Step 3: Functional Annotation and Literature

Step 4: Supplementary Analyses

Candidate Ranking

Report Structure

Required Sections (in this order)

HTML Implementation Notes

Important Considerations

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy