Convert a PDF file into a Markdown document with extracted images. Use this skill whenever the user wants to convert a PDF to Markdown, extract content from a PDF, pull images or diagrams out of a PDF, or turn a PDF into readable text with embedded visuals. Triggers on phrases like "pdf to markdown", "extract pdf", "convert pdf", "pdf images", "pdf content", "parse pdf", or any request to process or read a PDF file into a structured text format. Even if the user just says "I have a PDF I want to work with in Markdown" — use this skill.
This skill converts a PDF file into a Markdown document. It:
.md file with text and  references side by sideC:\Users\xursc\projects\M107\.claude\skills\pdf-to-markdown\scripts\extract-pdf-images.py
python "C:/Users/xursc/projects/M107/.claude/skills/pdf-to-markdown/scripts/extract-pdf-images.py" \
"<pdf_path>" \
--output "<output_dir>"
<pdf_path> — path to the source PDF (required)--output <output_dir> — folder where the .md and images are saved (default: current directory)./output or the same folder as the PDF if the user doesn't have a preference.PyMuPDF and Pillow. If they're missing, install them:
pip install pymupdf pillow
.md file<output_dir>/
├── <pdf-name>.md ← the Markdown file
└── <pdf-name>_images/ ← extracted images referenced in the .md
├── image_1.png
├── image_2.jpg
└── vector_page_3.png ← rendered pages with vector diagrams
pip install pymupdf pillow first.User: "Can you convert my thesis.pdf to Markdown?"
thesis.pdf? And where should I save the output?"./output/thesis.md.".md as a preview.