Name: pdf-to-markdown
Author: dujeonglee

스킬 검색.../

pdf-to-markdown | Skills Pool

scripts/pdf_to_markdown.py

python scripts/pdf_to_markdown.py input.pdf output.md

# Extract images alongside the markdown
python scripts/pdf_to_markdown.py input.pdf output.md --images

# Convert only pages 1 through 5
python scripts/pdf_to_markdown.py input.pdf output.md --pages "1-5"

# Convert specific pages
python scripts/pdf_to_markdown.py input.pdf output.md --pages "1,3,7-10"

# Plain text extraction without heading detection
python scripts/pdf_to_markdown.py input.pdf output.md --no-formatting

# Combined options
python scripts/pdf_to_markdown.py input.pdf output.md --images --pages "1-20"

Locate input — use the path the user provided, or search the current working directory for .pdf files

Run the script via Bash tool:

python .claude/skills/pdf-to-markdown/scripts/pdf_to_markdown.py <input.pdf> <output>.md [--images] [--pages "..."] [--no-formatting]

Place the output alongside the input file unless the user specifies a different location.

Report — show the output file path and number of pages converted

Situation	Recommendation
User doesn't mention images	Skip `--images` (text-only is faster and cleaner)
User wants figures or diagrams preserved	Use `--images`
User wants only specific pages	Use `--pages "1-5"` with the requested range
PDF has complex multi-column layout	Consider `--no-formatting` for cleaner raw text
User mentions a large PDF (100+ pages)	Suggest `--pages` to process in smaller batches

Error	Cause	Fix
`Input file not found`	Wrong path	Verify the file path and confirm filename
`Missing dependency — pymupdf`	PyMuPDF not installed	`pip install pymupdf`
`Warning: not a PDF`	Non-.pdf extension	Check if the file is actually a PDF
Poor text extraction	Scanned/image-based PDF	The PDF may need OCR preprocessing (e.g., `ocrmypdf`)

pdf-to-markdown

Features

Dependencies

Script location

pdf-to-markdown

Features

Dependencies

Script location

Usage

Basic

With options

Claude workflow

Choosing options

Post-conversion improvements

Error cases

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing