Name: Paddleocr Doc Parsing
Author: PaddlePaddle

Search skills.../

Paddleocr Doc Parsing | Skills Pool

Execute document parsing:
```
python scripts/vl_caller.py --file-url "URL provided by user" --pretty
```
Or for local files:
```
python scripts/vl_caller.py --file-path "file path" --pretty
```
Optional: explicitly set file type:
```
python scripts/vl_caller.py --file-url "URL provided by user" --file-type 0 --pretty
```
- --file-type 0: PDF
- --file-type 1: image
- If omitted, the service can infer file type from input.
Default behavior: save raw JSON to a temp file:
- If --output is omitted, the script saves automatically under the system temp directory
- Default path pattern: <system-temp>/paddleocr/doc-parsing/results/result_<timestamp>_<id>.json
- If --output is provided, it overrides the default temp-file destination
- If --stdout is provided, JSON is printed to stdout and no file is saved
- In save mode, the script prints the absolute saved path on stderr: Result saved to: /absolute/path/...
- In default/custom save mode, read and parse the saved JSON file before responding
- In save mode, always tell the user the saved file path and that full raw JSON is available there
- Use --stdout only when you explicitly want to skip file persistence
The output JSON contains COMPLETE content with all document data:
- Headers, footers, page numbers
- Main text content
- Tables with structure
- Formulas (with LaTeX)
- Figures and charts
- Footnotes and references
- Seals and stamps
- Layout and reading order
Input type note:
- Supported file types depend on the model and endpoint configuration.
- Always follow the file type constraints documented by your endpoint API.
Extract what the user needs from the output JSON using these fields:
- Top-level text
- result[n].markdown
- result[n].prunedResult

User: "Extract all the text from this document"
Agent: I've parsed the complete document. Here's all the extracted text:

[Display entire text field or concatenated regions in reading order]

Document Statistics:
- Total regions: 25
- Text blocks: 15
- Tables: 3
- Formulas: 2
Quality: Excellent (confidence: 0.92)

User: "Extract all the text"
Agent: "I found a document with multiple sections. Here's the beginning:
'Introduction...' (content truncated for brevity)"

{
  "ok": true,
  "text": "Full markdown/HTML text extracted from all pages",
  "result": { ... },  // raw provider response
  "error": null
}

python scripts/vl_caller.py \
  --file-url "https://example.com/paper.pdf" \
  --pretty

python scripts/vl_caller.py \
  --file-path "./financial_report.pdf" \
  --pretty

python scripts/vl_caller.py \
  --file-url "URL" \
  --stdout \
  --pretty

CONFIG_ERROR: PADDLEOCR_DOC_PARSING_API_URL not configured. Get your API at: https://paddleocr.com

Show the exact error message to the user (including the URL).
Guide the user to configure securely:
- Recommend configuring through the host application's standard method (e.g., settings file, environment variable UI) rather than pasting credentials in chat.
- List the required environment variables:
```
- PADDLEOCR_DOC_PARSING_API_URL
- PADDLEOCR_ACCESS_TOKEN
- Optional: PADDLEOCR_DOC_PARSING_TIMEOUT
```
If the user provides credentials in chat anyway (accept any reasonable format), for example:
- PADDLEOCR_DOC_PARSING_API_URL=https://xxx.paddleocr.com/layout-parsing, PADDLEOCR_ACCESS_TOKEN=abc123...
- Here's my API: https://xxx and token: abc123
- Copy-pasted code format
- Any other reasonable format
- Security note: Warn the user that credentials shared in chat may be stored in conversation history. Recommend setting them through the host application's configuration instead when possible.
Then parse and validate the values:
- Extract PADDLEOCR_DOC_PARSING_API_URL (look for URLs with paddleocr.com or similar)
- Confirm PADDLEOCR_DOC_PARSING_API_URL is a full endpoint ending with /layout-parsing
- Extract PADDLEOCR_ACCESS_TOKEN (long alphanumeric string, usually 40+ chars)
Ask the user to confirm the environment is configured.
Retry only after confirmation:
- Once the user confirms the environment variables are available, retry the original parsing task

python scripts/vl_caller.py --file-url "https://your-server.com/large_file.pdf"

# Extract pages 1-5
python scripts/split_pdf.py large.pdf pages_1_5.pdf --pages "1-5"

# Mixed ranges are supported
python scripts/split_pdf.py large.pdf selected_pages.pdf --pages "1-5,8,10-12"

# Then process the smaller file
python scripts/vl_caller.py --file-path "pages_1_5.pdf"

Paddleocr Doc Parsing

PaddleOCR Document Parsing Skill

When to Use This Skill

How to Use This Skill

Paddleocr Doc Parsing

PaddleOCR Document Parsing Skill

When to Use This Skill

How to Use This Skill

Basic Workflow

IMPORTANT: Complete Content Display

Understanding the JSON Response

Usage Examples

First-Time Configuration

Handling Large Files

Use URL for Large Local Files (Recommended)

Process Specific Pages (PDF Only)

Error Handling

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing