PMC-OAI API Guide

Overview

PubMed Central (PMC) is a free full-text archive of biomedical and life sciences journal literature at the U.S. National Institutes of Health's National Library of Medicine (NIH/NLM). The PMC OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) service provides a standardized interface for systematically harvesting metadata and full-text content from the PMC archive.

The OAI-PMH protocol is an internationally recognized standard for metadata harvesting, widely used by libraries, repositories, and research infrastructure. The PMC implementation allows researchers to programmatically discover and retrieve article metadata, including titles, authors, abstracts, MeSH terms, publication dates, and links to full-text XML and PDF versions. This is particularly valuable for building local search indexes, systematic review pipelines, and text mining corpora.

Biomedical researchers, systematic reviewers, bioinformaticians, medical librarians, and text mining specialists use the PMC OAI-PMH service to harvest large collections of open-access biomedical literature for meta-analyses, natural language processing research, knowledge graph construction, and institutional repository enrichment. PMC contains over 9 million full-text articles, making it one of the largest open-access biomedical literature collections in the world.

Parameter	Type	Required	Description
verb	string	Yes	Must be `ListRecords`
metadataPrefix	string	Yes	Metadata format: `oai_dc`, `pmc`, or `pmc_fm`
set	string	No	Filter by set (e.g., journal, open access subset)
from	string	No	Start date for selective harvesting (YYYY-MM-DD)
until	string	No	End date for selective harvesting (YYYY-MM-DD)
resumptionToken	string	No	Token for paginating through large result sets

Parameter	Type	Required	Description
verb	string	Yes	Must be `GetRecord`
identifier	string	Yes	OAI identifier (e.g., `oai:pubmedcentral.nih.gov:1234567`)
metadataPrefix	string	Yes	Metadata format: `oai_dc`, `pmc`, or `pmc_fm`

Pmc Oai Api

Pmc Oai Api

PMC-OAI API Guide

Overview

Authentication

Core Endpoints

ListRecords: Harvest Article Metadata

GetRecord: Retrieve a Single Record

ListSets: Discover Available Sets

Rate Limits

Common Patterns

Incremental Metadata Harvesting

Build a Local Search Index

Discover Available Journal Sets

References

Deep Research

Data Analyst

Academic Researcher

Data Scientist

Biopython

Binary Analysis Patterns