Structured hypothesis formulation from observations. Use when you have experimental observations or data and need to formulate testable hypotheses with predictions, propose mechanisms, and design experiments to test them. Follows scientific method framework. For open-ended ideation use scientific-brainstorming; for automated LLM-driven hypothesis testing on datasets use hypogenic.
Hypothesis generation is a systematic process for developing testable explanations. Formulate evidence-based hypotheses from observations, design experiments, explore competing explanations, and develop predictions. Apply this skill for scientific inquiry across domains.
This skill should be used when:
Follow this systematic process to generate robust scientific hypotheses:
Start by clarifying the observation, question, or phenomenon that requires explanation:
Search existing scientific literature to ground hypotheses in current evidence. Use both PubMed (for biomedical topics) and general web search (for broader scientific domains):
For biomedical topics:
For all scientific domains:
Search strategy:
references/literature_search_strategies.md for detailed search techniquesAnalyze and integrate findings from literature search:
Develop 3-5 distinct hypotheses that could explain the phenomenon. Each hypothesis should:
Strategies for generating hypotheses:
Assess each hypothesis against established quality criteria from references/hypothesis_quality_criteria.md:
Testability: Can the hypothesis be empirically tested? Falsifiability: What observations would disprove it? Parsimony: Is it the simplest explanation that fits the evidence? Explanatory Power: How much of the phenomenon does it explain? Scope: What range of observations does it cover? Consistency: Does it align with established principles? Novelty: Does it offer new insights beyond existing explanations?
Explicitly note the strengths and weaknesses of each hypothesis.
For each viable hypothesis, propose specific experiments or studies to test it. Consult references/experimental_design_patterns.md for common approaches:
Experimental design elements:
Consider multiple approaches:
For each hypothesis, generate specific, quantitative predictions:
Generate output as a Databricks-compatible Markdown report. Use displayHTML() for colored visual sections or write Markdown cells using %md. Do NOT generate LaTeX or compile PDFs.
Output format:
Write the full report as a Python string passed to displayHTML(), or output clean Markdown that can be pasted into a %md cell.
Report structure:
HTML block template for each hypothesis:
displayHTML("""
<div style="background:#e8f4fd;border-left:5px solid #2196F3;padding:16px;margin:12px 0;border-radius:4px">
<h3 style="color:#1565C0;margin:0 0 8px 0">Hypothesis 1: [Title]</h3>
<p><b>Mechanism:</b> ...</p>
<p><b>Key Evidence:</b></p>
<ul>
<li>...</li>
</ul>
<p><b>Core Assumptions:</b> ...</p>
</div>
""")
Color palette for hypothesis blocks (use in order):
| Hypothesis | Background | Border |
|---|---|---|
| H1 | #e8f4fd | #2196F3 (blue) |
| H2 | #e8f5e9 | #4CAF50 (green) |
| H3 | #f3e5f5 | #9C27B0 (purple) |
| H4 | #e0f2f1 | #009688 (teal) |
| H5 | #fff3e0 | #FF9800 (orange) |
Predictions block:
displayHTML("""
<div style="background:#fffde7;border-left:5px solid #FFC107;padding:16px;margin:12px 0;border-radius:4px">
<h3 style="color:#F57F17;margin:0 0 8px 0">Testable Predictions</h3>
<ul>
<li><b>H1:</b> ...</li>
<li><b>H2:</b> ...</li>
</ul>
</div>
""")
Comparison block:
displayHTML("""
<div style="background:#f5f5f5;border-left:5px solid #607D8B;padding:16px;margin:12px 0;border-radius:4px">
<h3 style="color:#37474F;margin:0 0 8px 0">Critical Comparisons</h3>
<p>...</p>
</div>
""")
Citation format: Use inline author-year style: (Smith et al., 2023). List full references at the end of the notebook in a %md cell.
Citation targets:
Ensure all generated hypotheses meet these standards:
hypothesis_quality_criteria.md — Framework for evaluating hypothesis quality (testability, falsifiability, parsimony, explanatory power, scope, consistency)experimental_design_patterns.md — Common experimental approaches across domains (RCTs, observational studies, lab experiments, computational models)literature_search_strategies.md — Effective search techniques for PubMed and general scientific sources