Name: Mineru Ocr Cli
Author: neipor

Search skills.../

Mineru Ocr Cli | Skills Pool

# macOS — Universal binary (Intel + Apple Silicon)
curl -fsSL https://github.com/neipor/mineru-cli/releases/latest/download/mineru-universal-apple-darwin.tar.gz \
  | tar -xz --strip-components=1 -C /usr/local/bin/ mineru-*/mineru
chmod +x /usr/local/bin/mineru && mineru --version

# Linux x86_64  (static musl — works on Ubuntu, Debian, Alpine, RHEL, …)
curl -fsSL https://github.com/neipor/mineru-cli/releases/latest/download/mineru-x86_64-unknown-linux-musl.tar.gz \
  | tar -xz --strip-components=1 -C /usr/local/bin/ mineru-*/mineru
chmod +x /usr/local/bin/mineru && mineru --version

# Linux ARM64  (Raspberry Pi 4/5, AWS Graviton, Oracle Ampere, …)
curl -fsSL https://github.com/neipor/mineru-cli/releases/latest/download/mineru-aarch64-unknown-linux-musl.tar.gz \
  | tar -xz --strip-components=1 -C /usr/local/bin/ mineru-*/mineru
chmod +x /usr/local/bin/mineru && mineru --version

# Download and extract
$ver = (Invoke-RestMethod "https://api.github.com/repos/neipor/mineru-cli/releases/latest").tag_name
$url = "https://github.com/neipor/mineru-cli/releases/download/$ver/mineru-$ver-x86_64-pc-windows-msvc.zip"
Invoke-WebRequest -Uri $url -OutFile "$env:TEMP\mineru.zip"
Expand-Archive "$env:TEMP\mineru.zip" -DestinationPath "$env:TEMP\mineru-pkg" -Force

# Copy to a directory that is already on PATH (System32 alternative: %LOCALAPPDATA%\Programs\mineru)
$dest = "$env:LOCALAPPDATA\Programs\mineru"
New-Item -ItemType Directory -Force $dest | Out-Null
Copy-Item "$env:TEMP\mineru-pkg\*\mineru.exe" $dest

# Permanently add to user PATH (takes effect in new shells)
$currentPath = [Environment]::GetEnvironmentVariable("Path", "User")
if ($currentPath -notlike "*$dest*") {
    [Environment]::SetEnvironmentVariable("Path", "$currentPath;$dest", "User")
    Write-Host "Added $dest to PATH — restart your terminal."
}

# Verify
& "$dest\mineru.exe" --version

git clone https://github.com/neipor/mineru-cli.git
cd mineru-cli
cargo build --release

# Install to /usr/local/bin/ (system-wide)
sudo cp target/release/mineru /usr/local/bin/

# Or install to ~/.cargo/bin/ (user-level, already on PATH after `rustup` setup)
cargo install --path .

cargo install mineru-cli
# Binary is placed at ~/.cargo/bin/mineru

mineru --version   # should print: mineru x.y.z
mineru --help      # full option reference

# macOS / Linux — set once for the session
export HTTPS_PROXY=http://127.0.0.1:7890   # replace with your proxy port
mineru document.pdf

# Or inline for a single command
HTTPS_PROXY=http://127.0.0.1:7890 mineru document.pdf

# Windows PowerShell
$env:HTTPS_PROXY = "http://127.0.0.1:7890"
mineru document.pdf

export MINERU_SERVER_URL=https://your-mirror.example.com
mineru document.pdf

mineru document.pdf --server-url https://your-mirror.example.com

pip install mineru[full]
python -m mineru.cli.gradio_app --server-name 0.0.0.0 --server-port 7860
mineru document.pdf --server-url http://localhost:7860

mineru [OPTIONS] <FILES>...

Arguments:
  <FILES>...    One or more input files (globs work in most shells: *.pdf)

Options:
  -f, --format <FORMAT>
          Output format [default: markdown]
          Possible values: markdown, json, plain

  -p, --pages <N>
          Maximum pages to process per document [default: 20]
          The public MinerU Space enforces a hard 20-page limit.

      --ocr
          Force OCR mode. Use when native text extraction fails (scanned PDFs,
          image-only documents). Slightly slower but more thorough.

      --no-formulas
          Disable LaTeX formula recognition (faster for documents without math).

      --no-tables
          Disable table structure recognition.

  -l, --lang <LANG>
          OCR language hint [default: "ch (Chinese, English, Chinese Traditional)"]
          Common values: ch, en, fr, de, ja, ko, ar
          Affects OCR character set; does not restrict document content.

  -b, --backend <BACKEND>
          Processing backend [default: hybrid-auto-engine]
          Possible values:
            pipeline          — Traditional multi-model pipeline, hallucination-free
            vlm-auto-engine   — VLM-based, Chinese/English only, highest accuracy
            hybrid-auto-engine — Hybrid, best overall multi-language (default)

  -o, --output-dir <DIR>
          Save output Markdown (or JSON / txt) plus an images/ sub-folder here.
          When omitted, content goes to stdout and image refs are shown as tags.

      --embed-images
          Inline all images as base64 data URIs in the Markdown output.
          Useful for multimodal LLMs that accept self-contained Markdown.
          Ignored when --output-dir is set (images are always saved as files).

      --server-url <URL>
          Custom Gradio server base URL [env: MINERU_SERVER_URL]
          [default: https://opendatalab-mineru.hf.space]

  -q, --quiet
          Suppress all progress output. Only the document content is written to
          stdout — ideal for shell pipelines.

  -h, --help      Print help
  -V, --version   Print version

# Extract a PDF to stdout (Markdown)
mineru paper.pdf

# Extract only the first 5 pages
mineru report.pdf --pages 5

# Force OCR on a scanned document
mineru scan.pdf --ocr

# English-only document, disable formula/table detection for speed
mineru letter.pdf --lang en --no-formulas --no-tables

# Save Markdown + images to ./output/
mineru report.pdf -o ./output/

# Save as JSON (includes metadata block)
mineru report.pdf -f json -o ./output/

# Batch: convert all PDFs in current directory
mineru *.pdf -o ./extracted/

# Batch with progress suppressed (CI / scripting)
mineru *.pdf -o ./extracted/ -q

# Summarize with the llm CLI
mineru paper.pdf -q | llm "Summarize in 5 bullet points"

# Chat with a document via ollama
mineru doc.pdf -f plain -q | ollama run llama3 "What are the key findings?"

# Feed a PDF to GPT-4 via the openai CLI
mineru paper.pdf -q | openai api chat.completions.create \
  -m gpt-4o --message "$(cat)"

# Embed images for multimodal LLMs
mineru figure_heavy_paper.pdf --embed-images -q | llm -m gpt-4o "Describe the figures"

# VLM backend for highest Chinese accuracy
mineru chinese_report.pdf -b vlm-auto-engine -l ch

# Process a PPTX presentation
mineru slides.pptx -o ./slides_output/

# Via proxy (China mainland)
HTTPS_PROXY=http://127.0.0.1:7890 mineru document.pdf -o ./output/

# Self-hosted server
mineru document.pdf --server-url http://192.168.1.100:7860

<!-- source: paper.pdf | processed: 2026-04-03 14:26 UTC | backend: hybrid-auto-engine | pages: 3 -->

# Attention Is All You Need

## Abstract
The dominant sequence transduction models...

$$\text{Attention}(Q, K, V) = \text{softmax}\!\left(\frac{QK^T}{\sqrt{d_k}}\right)V$$

{
  "meta": {
    "source_file": "paper.pdf",
    "processed_at": "2026-04-03T14:26:00Z",
    "backend": "hybrid-auto-engine",
    "pages": 3,
    "ocr": false,
    "formula": true,
    "table": true,
    "language": "ch (Chinese, English, Chinese Traditional)"
  },
  "content": "# Attention Is All You Need\n\n...",
  "status_log": ["Preparing request...", "Processing on server (0.0s)", "Completed"]
}

output/
├── document.md            ← Markdown with relative image links
└── document/
    └── images/
        ├── abc123.jpg     ← Extracted images referenced from the .md
        └── def456.jpg

[🖼 Image 1: figure1.jpg (45 KB)]

Backend	Best for	Notes
`hybrid-auto-engine`	General use, multi-language	Default
`pipeline`	Hallucination-sensitive workloads	Traditional multi-model, no generative steps
`vlm-auto-engine`	Highest accuracy (Chinese / English)	VLM-based; slower but most precise

Constraint	Value
Max pages (public Space)	20 per document
Typical processing time	1–5 seconds / page on L40S GPU
API key required	None
Internet required	Yes (calls HuggingFace Space)
Max file size	~50 MB (Space limit)

Mineru Ocr Cli

What this skill does

Mineru Ocr Cli

What this skill does

Installation

Option A — One-liner install (macOS / Linux, recommended)

Option B — Windows (PowerShell)

Option C — Build from source (requires Rust 1.85+)

Option D — cargo install (crates.io)

Verify the installation

🌐 Network Environment

Workaround 1 — System proxy (simplest)

Workaround 2 — `MINERU_SERVER_URL` env var

Workaround 3 — Self-host MinerU (requires a CUDA GPU)

Full option reference

Usage examples

Basic extraction

Save to files

LLM pipe workflows

Advanced

Output formats

`markdown` (default) — LLM-optimised

`json` — structured with metadata

`plain` — stripped text

Output directory layout (when `-o` is used)

Backends

Limits and performance

How it works (internals)

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing

Mineru Ocr Cli

What this skill does

Mineru Ocr Cli

What this skill does

Installation

Option A — One-liner install (macOS / Linux, recommended)

Option B — Windows (PowerShell)

Option C — Build from source (requires Rust 1.85+)

Option D — cargo install (crates.io)

Verify the installation

🌐 Network Environment

Workaround 1 — System proxy (simplest)

Workaround 2 — MINERU_SERVER_URL env var

Workaround 3 — Self-host MinerU (requires a CUDA GPU)

Full option reference

Usage examples

Basic extraction

Save to files

LLM pipe workflows

Advanced

Output formats

markdown (default) — LLM-optimised

json — structured with metadata

plain — stripped text

Output directory layout (when -o is used)

Backends

Limits and performance

How it works (internals)

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing

Workaround 2 — `MINERU_SERVER_URL` env var

`markdown` (default) — LLM-optimised

`json` — structured with metadata

`plain` — stripped text

Output directory layout (when `-o` is used)