Skill File

Pdf Translation

Name: Pdf Translation
Author: wayne930242

Use when processing PDF rulebooks, extracting raw markdown and image artifacts, or preparing source materials before `chapter-split`, `init-doc`, or translation setup.

wayne9302427 starsMar 9, 2026

Occupation
Categories: Documents

Skill Content

PDF Translation Workflow

Overview

Convert a source PDF into extracted markdown and image artifacts ready for chapter planning and translation setup.

Core principle: Extract cleanly and verify raw artifacts before chapter planning starts.

The Process

Step 1: Extract PDF

Run:

uv run python scripts/extract_pdf.py data/pdfs/<filename>.pdf

Expected outputs in data/markdown/:

<name>.md
<name>_pages.md
images/<name>/

Step 2: Validate Raw Outputs

Validate:

Related Skills

Pdf Translation | Skills Pool

Pdf Translation

PDF Translation Workflow

Overview

The Process

Step 1: Extract PDF

Step 2: Validate Raw Outputs

Pdf Translation

PDF Translation Workflow

Overview

The Process

Step 1: Extract PDF

Step 2: Validate Raw Outputs

Step 3: Re-extract or Fix Source if Needed

Step 4: Handoff

Progress Sync Contract (Required)

When to Stop and Ask for Help

When to Revisit Earlier Steps

Red Flags

Next Step

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing