Name: Epistemics
Author: markusstrasser

Bio/Medical Research Epistemics

Domain-specific guardrails for scientific research. Use alongside researcher for the workflow; this skill provides the evidence hierarchy, anti-hallucination rules, and bio-specific failure modes.

Anti-Hallucination Rules (non-negotiable)

Citation requirement: Every non-trivial factual claim needs a resolvable citation (DOI, PMID, ClinicalTrials.gov ID, or official URL). If you can't cite it, label "UNCITED."
No fake citations: Never invent paper titles, authors, journals, or numbers. If you can't find the paper, say so.
Separate evidence layers: Keep strictly distinct:
- (a) In vitro / cell culture
- (b) Animal model (species, dose, route)
- (c) Human observational / GWAS association
- (d) Human RCT — surrogate endpoint (biomarker)
- (e) Human RCT — clinical outcome (patient-important)
- (f) Systematic review / meta-analysis

Grade	Type	Notes
1	Clinical guideline / consensus	NICE, WHO, AAD, CPIC, DPWG
2	Systematic review / meta-analysis	Cochrane, PRISMA-compliant
3	Well-powered RCT	Pre-registered, independent, adequate N
4	Small / pilot RCT	Underpowered, often industry-funded
5	Large observational / cohort	Adjusted, replicated
6	GWAS / genetic association	Report OR, CI, population, replication
7	Animal model	Species, dose, route — note translatability
8	In vitro / cell culture	Note concentration vs physiological
9	Case report / expert opinion	Lowest weight

Model	Failure Mode	Severity	Notes
Claude (Opus 4.6)	Sycophantic hedging; agrees then qualifies until useless	Medium	Improved from 4.5 but still present
Claude	Citation-shaped bullshit; plausible references that don't exist	High	CoT unfaithfulness baseline: 7-13% on clean prompts (ICLR 2026)
Claude	Genotype determinism; treats associations as deterministic	High
GPT (5.2–5.4)	Confident fabrication; invents complete fake studies with authors and N	Critical	Worse with extended thinking enabled. 5.4 improved (SimpleQA ~72%) but still rarely refuses — fabricates confidently
GPT	Overcitation; cites 20+ papers, many tangential or unverifiable	Medium
Gemini (3.1 Pro)	Google-source bias; over-relies on Scholar snippets without reading papers	High	1M context invites dumping entire papers without processing
Gemini	Length inflation; massive outputs that bury the signal	Medium
All models	Implicit post-hoc rationalization; unfaithful CoT on clean prompts	Medium	7-13% baseline rate (arXiv, ICLR 2026 submission). Not adversarial — happens on normal prompts

Epistemics

Epistemics

Bio/Medical Research Epistemics

Anti-Hallucination Rules (non-negotiable)

Evidence Hierarchy

Inference Rules

Interpretation Compression Rules

Bio-Specific Failure Modes

LLM-Specific Failure Modes (updated Feb 2026)

Recitation Before Synthesis

Self-Audit Checklist

PGx Quick Reference

Non-Paper Evidence (acceptable, labeled)

Deep Research

Data Analyst

Academic Researcher

Data Scientist

Biopython

Binary Analysis Patterns