Use the dedicated document_parse tool as the default interface. Do not fall back to manual lit CLI commands unless the user explicitly asks for the raw command line workflow or the extension tool is unavailable.

Efficient tool usage

Choose the smallest useful output

Use format: "text" when the user wants to read, summarize, quote, search, or review the document.
Use format: "json" when the user needs structured page data, text positions, or bounding boxes.
Avoid asking for JSON unless coordinates or programmatic structure actually matter.

Limit scope early

If the task concerns only part of a document, pass targetPages instead of parsing everything.

Examples:

a single chapter
a cited appendix
a page range from the user
a specific page mentioned in an error report or screenshot request

Use OCR deliberately

Efficient tool usage

Choose the smallest useful output

Use format: "text" when the user wants to read, summarize, quote, search, or review the document.
Use format: "json" when the user needs structured page data, text positions, or bounding boxes.
Avoid asking for JSON unless coordinates or programmatic structure actually matter.

Limit scope early

If the task concerns only part of a document, pass targetPages instead of parsing everything.

Examples:

a single chapter
a cited appendix
a page range from the user
a specific page mentioned in an error report or screenshot request

Parse Document

Efficient tool usage

Choose the smallest useful output

Limit scope early

Use OCR deliberately

Parse Document

Efficient tool usage

Choose the smallest useful output

Limit scope early

Use OCR deliberately

Use screenshots only when text is not enough

Follow-up workflow

Important constraints and expectations

Good default patterns

Summarize or review a document

Extract positions or bounding boxes

Review a visually complex page

Parse a scanned or image-based document

Parameter reminders

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing