Medical Question Answering
Purpose
Answer medical and biomedical questions with evidence-based precision using structured datasets and clinical knowledge bases.
Key Datasets
- MedQuAD (abachaa/MedQuAD): 47,457 QA pairs from 12 NIH sources (NCI, GARD, GHR, MedlinePlus, NIDDK, NHLBI, NICHD, NIA, NIAMS, NINDS, NIDA, GARD)
- PubMedQA (qiaojin/PubMedQA): Yes/No/Maybe reasoning from PubMed abstracts
Protocol
- Parse the question — Identify medical entities (diseases, drugs, genes, symptoms)
- Source identification — Match question type to appropriate NIH source
- Evidence retrieval — Search PubMed, clinical guidelines, drug databases
- Answer synthesis — Provide answer with confidence level and citations