Convert Word (.docx) documents to clean Markdown. Use when: converting docx to markdown, importing Word documents, extracting content from .docx files, migrating documentation from Word format.
Convert Microsoft Word (.docx) documents to well-structured Markdown files with image extraction.
.docx file to .mdpandoc must be installed. Check with:
command -v pandoc
If missing, install:
# macOS
brew install pandoc
# Ubuntu/Debian
sudo apt-get install pandoc
# Windows (winget)
winget install --id JohnMacFarlane.Pandoc
Confirm the .docx file path with the user. If not provided, ask for it.
.md extensionimages/ subdirectory next to the output file for extracted mediaExecute the conversion script:
bash .github/skills/docx-to-markdown/scripts/convert.sh "<input.docx>" "<output.md>"
The script:
images/ folder next to the outputAfter conversion, review and fix the output:
# top-level heading; adjust if pandoc produced multiple H1simages/ directory and paths resolve\newpage, empty anchors, {.underline} attributes).md file and confirm it renders correctly