Fetch articles from PubMed Central using NCBI APIs. Search journals, retrieve full text via OAI-PMH, batch harvest for RAG pipelines. No API key required.
Fetch full-text articles from PubMed Central using official NCBI APIs.
# Search a journal
node {baseDir}/scripts/pmc-harvest.js --search "J Stroke[journal]" --year 2025
# Fetch full text for a specific article
node {baseDir}/scripts/pmc-harvest.js --fetch PMC12345678
# Batch harvest from multiple journals
node {baseDir}/scripts/pmc-harvest.js --harvest journals.json --year 2025
# Test with known journals
node {baseDir}/scripts/pmc-harvest.js --test
| Flag | Description |
|---|---|
--search <query> | PMC search query (use journal[name] format) |
--year <year> | Filter by publication year |
--max <n> | Max results (default: 100) |
--fetch <pmcid> | Fetch full text for specific PMCID |
--harvest <file> | Batch harvest from JSON journal list |
--test | Run test with sample journals |
const pmc = require('{baseDir}/lib/api.js');
// Search
const { count, pmcids } = await pmc.searchJournal('"J Stroke"[journal]', { year: 2025 });
// Get summaries
const summaries = await pmc.getSummaries(pmcids);
// Fetch full text
const { available, xml, reason } = await pmc.fetchFullText('PMC12345678');
// Parse JATS XML
const { title, abstract, body } = pmc.parseJATS(xml);
// Fetch abstract only (lightweight)
const { title, abstract } = await pmc.fetchAbstract('PMC12345678');
const queries = {
'Stroke': '"Stroke"[journal]',
'Journal of Stroke': '"J Stroke"[journal]',
'Stroke & Vascular Neurology': '"Stroke Vasc Neurol"[journal]',
'European Stroke Journal': '"Eur Stroke J"[journal]',
'BMC Neurology': '"BMC Neurol"[journal]'
};
This skill wraps NCBI's official APIs:
https://eutils.ncbi.nlm.nih.gov/entrez/eutils
esearch.fcgi — Search PMCesummary.fcgi — Get article metadatahttps://pmc.ncbi.nlm.nih.gov/api/oai/v1/mh
GetRecord — Fetch full text XMLFull docs: https://www.ncbi.nlm.nih.gov/books/NBK25501/