Name: All2md Read
Author: thomas-villani

스킬 검색.../

All2md Read | Skills Pool

# Specific pages
all2md document.pdf --pdf-pages "1-3,5,10-15"

# Table detection
all2md document.pdf --pdf-detect-tables

# OCR for scanned documents
all2md scanned.pdf --pdf-ocr-enabled --pdf-ocr-mode auto

# Combined
all2md document.pdf --pdf-pages "1-5" --pdf-detect-tables --pdf-ocr-enabled

# Preserve formatting hints
all2md report.docx --docx-preserve-formatting

# Extract comments and tracked changes
all2md report.docx --docx-extract-comments

# Extract title as heading
all2md page.html --html-extract-title

# Download and save images
all2md page.html --attachment-mode save --attachment-output-dir ./images

# Embed images as base64
all2md page.html --attachment-mode base64

# Include attachment content
all2md message.eml --eml-include-attachments

# Detect email chains
all2md thread.eml --eml-detect-chains

# Specific sheet
all2md data.xlsx --xlsx-sheet "Sheet2"

# Notebook with outputs
all2md notebook.ipynb --ipynb-include-outputs

# Extract by heading name
all2md document.pdf --extract "Introduction"

# Extract by heading index range
all2md document.pdf --extract "#:1-3"

# Show document outline / table of contents
all2md document.pdf --outline

# Convert entire directory recursively
all2md ./documents -r -o ./markdown

# Parallel processing
all2md ./documents -r --parallel 4 --output-dir ./converted

# Combine multiple files into one
all2md *.pdf --collate -o combined.md

from all2md import to_markdown

# From file path
markdown = to_markdown("document.pdf")

# With options
markdown = to_markdown("document.pdf", pages="1-3", flavor="gfm")

# From bytes
markdown = to_markdown(pdf_bytes)

# From stdin
import sys
markdown = to_markdown(sys.stdin.buffer)

from all2md import to_ast

# Parse to AST for programmatic access
doc = to_ast("document.pdf")

# Access document structure
for node in doc.children:
    print(node.type, node.text_content[:50])

from all2md import to_markdown
from all2md.options.pdf import PdfOptions
from all2md.options.markdown import MarkdownRendererOptions

pdf_opts = PdfOptions(pages="1-5", detect_tables=True)
md_opts = MarkdownRendererOptions(flavor="gfm")

markdown = to_markdown(
    "document.pdf",
    parser_options=pdf_opts,
    renderer_options=md_opts,
)

from all2md import to_markdown

# Apply transforms during conversion
markdown = to_markdown(
    "document.pdf",
    transforms=["remove-images", "heading-offset"],
)

All2md Read

Reading Documents with all2md

Overview

CLI Quick Reference

Basic Usage

PDF Options

All2md Read

Reading Documents with all2md

Overview

CLI Quick Reference

Basic Usage

PDF Options

DOCX Options

HTML Options

Email Options

Excel and Notebooks

Section Extraction

Batch Processing

Python API

Simple Conversion

AST Access

With Parser Options

With Transforms

Supported Input Formats

Tips

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing