Use in Claude Code when a user asks to read, analyze, summarize, or extract from a heavyweight file such as PDF, DOCX, PPTX, XLSX, CSV, or TSV. Convert the file into markdown or CSV first with the bundled script, generate a lightweight index, and only spend model tokens on the compressed artifact.
Claude Code has the tools to convert files locally, so it should not waste context by reading heavyweight files raw.
scripts/convert_heavy_file.pypython scripts/convert_heavy_file.py /absolute/path/to/file.ext
uv run \
--with pdfplumber \
--with python-docx \
--with python-pptx \
--with openpyxl \
python scripts/convert_heavy_file.py /absolute/path/to/file.ext
index.md before reading any converted artifact.read_extracted_artifact: read the markdown or CSV and continueinstall_dependency_and_retry: install the missing deterministic dependency and reruncheap_model_or_stronger_converter: retry with a better converter or use a cheaper model only on the extracted artifactreferences/open-source-stack.md explains the tool choices and fallback strategy.Edit PDFs with natural-language instructions using the nano-pdf CLI.