Skill for creating new nutrient YAML files from MIC website content. Use this when extracting a nutrient from lpi.oregonstate.edu/mic. Also useful for enhancing existing nutrient entries.
Guide the creation of new nutrient YAML files in the MIC knowledge base. The MIC website (lpi.oregonstate.edu/mic) serves as the authoritative research source. This skill emphasizes evidence-based extraction with proper ontology grounding.
kb/nutrients/This skill can also be consulted for ongoing curation of existing nutrients.
Determine the nutrient and its category:
vitamins/ - Water-soluble and fat-soluble vitaminsminerals/ - Essential mineral elementsdietary-factors/ - Other dietary factors (fiber, flavonoids, etc.)food-beverages/ - Specific foods or beveragesCheck if it already exists:
ls kb/nutrients/**/*.yaml
Update CURATION-PROGRESS.md to mark the nutrient as in progress:
[ ] to [~] to indicate work has startedExample: Change this line:
- [ ] Folate (`kb/nutrients/vitamins/folate.yaml`)
To:
- [~] Folate (`kb/nutrients/vitamins/folate.yaml`) - in progress
This helps track curation status and prevents duplicate work.
Download the MIC page using the just fetch-mic-page command:
# Format: just fetch-mic-page {category}/{nutrient}
just fetch-mic-page vitamins/biotin
just fetch-mic-page minerals/calcium
just fetch-mic-page dietary-factors/lipoic-acid
This downloads the HTML to cache/mic-pages/{nutrient}.html.
Verify the download:
ls -la cache/mic-pages/biotin.html
IMPORTANT: Before adding evidence, extract the MIC reference number → PMID mapping:
# Get TSV mapping of ref# to PMID
just extract-refs cache/mic-pages/biotin.html
This outputs a TSV with columns: source, reference_number, pubmed_id, citation
Example output:
source reference_number pubmed_id citation
biotin.html 1 Zempleni J, Wijeratne SSK...
biotin.html 2 PMID:10357733 Mock DM. Biotin...
biotin.html 3 PMID:15992684 Zempleni J, Hassan YI...
Key points:
pubmed_id = book chapter or non-PMID source (use mic_references only)pubmed_id = can fetch abstract and add PMID evidencereference_number corresponds to mic_references values in the YAMLSave the mapping for reference during curation:
just extract-refs cache/mic-pages/biotin.html > cache/refs/biotin-refs.tsv
Create kb/nutrients/{category}/{nutrient}.yaml with the basic structure: