Name: Document Review
Author: kortix-ai

Buscar habilidades.../

issue_type: One of the five types defined below (spelling_grammar, narrative_logic, non_public_info, verify_public_data, numerical_consistency)
severity: high, medium, or low (see Severity Levels below)
original_text: The exact verbatim text from the document that contains the issue. This is the text you would highlight to show a reader where the problem is.
description: A concise explanation of why original_text is wrong or problematic. Write as 1-2 direct sentences that a reviewer can scan without re-reading. Lead with what's wrong, then state what's correct or expected. Avoid filler ("It appears that...", "There seems to be...") — state the problem directly. Examples: "Revenue is stated as $28T but publicly reported figures show $25.5T for 2023." / "The word 'acheived' is misspelled." / "Page 3 says 'founded in 2015' but page 7 says 'celebrating 10 years' in 2023, which implies 2013."
text_context: A longer passage containing original_text with surrounding text before and after. Used to disambiguate when original_text appears multiple times in the document.
new_text: The suggested replacement for original_text. Should closely resemble the original but with the issue fixed.
section: The document section name the issue is in.
location: Where the issue is located — always a string. For PDF/DOCX/PPTX: the page or slide number (e.g., "3"). For XLSX: the sheet name (e.g., "Revenue").
anchor: Precise position within the location, or null. For PDF/DOCX/PPTX: a descriptive label (e.g., "paragraph 5", "Risk Factors heading", "Table 2, row 3"). For XLSX: a cell reference (e.g., "B14"). Be specific — use "paragraph 5" not "5".
issue_id: Unique identifier in the form issue:<N> where N is a positive integer (e.g., issue:1, issue:2).
root_issue_id: The issue_id of an earlier issue that caused this one. Use when an error cascades — e.g., page 1 says "U.S. population is 200 million" (wrong, issue:1), then page 2 says "10% of the U.S. population is 20 million" — the math is correct for the stated figure but based on the wrong number from issue:1, so sets to . Omit when the issue is independent.

spelling_grammar: Misspelled words and grammatical errors.
- IS: "acheived" → "achieved" (misspelling)
- IS: "A total of $100 millions dollars" (grammar: "millions" → "million")
- IS: "the the report" (repeated word)
- IS: "Their going to expand" ("Their" → "They're")
- NOT: Wrong dates, wrong numbers, backwards timelines (→ narrative_logic)
- NOT: Redundant or nonsensical titles/headings (→ narrative_logic)
- NOT: Real names used instead of code names (→ non_public_info)
- Test: Could a spell-checker or grammar-checker catch this? If yes → spelling_grammar. If it requires understanding meaning or context → it's another type.
narrative_logic: Logical errors, contradictions, timeline problems, nonsensical content, or structural issues that require understanding meaning to detect.
- IS: "Founded in 2015" on page 3 but "celebrating 10 years" in 2023 (contradiction)
- IS: "Expected to grow from 500 in 2025 to 600 in 2020" (backwards timeline)
- IS: "Comparable Comps Analysis" (tautological — "Comparable" and "Comps" mean the same thing)
- IS: "Fitness Industry Page" as a section title ("Page" is redundant — the audience already knows it's a page)
- IS: Document dated January but references "Q4 results" as if complete (impossible timeline)
- IS: A section header that contradicts the document's own conventions or makes no sense
- NOT: Misspelled words (→ spelling_grammar)
- NOT: Confidential information leaks (→ non_public_info)
- Test: Does this require understanding what the words mean (dates, logic, context) to spot the error? If yes → narrative_logic.
non_public_info: Confidential information that should not appear in the document.
- IS: Using real company names instead of code names (e.g., "Planet Fitness" when code name is "Pluto")
- IS: Individual salaries, SSNs, private email addresses, phone numbers
- IS: Unreleased product details, internal strategies, board deliberations
- IS: "STRICTLY CONFIDENTIAL" or "INTERNAL USE ONLY" markers left in a public-facing document
- NOT: Misspelled names (→ spelling_grammar)
- NOT: Contradictions about confidential info (→ narrative_logic)
- Test: Is the problem that confidential or private information is exposed? If yes → .

session_start_background(
  project="<project-name-if-applicable>",
  prompt="Load the document-review skill and execute its workflow to review the attached document. Issue types: [list the relevant issue types]."
)

Turn 4: web_search("Acme Corp 2023 annual report") + web_search("U.S. census 2023") + bash(calc claim:5) + bash(calc claim:6)
         → broad searches verify claims 1-3 (revenue, headcount, founding date) + claim 4 (population) + claims 5-6 (math)
Turn 5: update-claims [1-6] + web_search("Acme Corp acquisition history") + bash(calc claim:9)
         → narrow search for claims 7-8 that broad searches missed
Turn 6: update-claims [7-9] + web_search("specific fact still unverified")

PDF/PPTX/XLSX — Single command; the script reads issues from document_review_state.json automatically:
- python skills/document-review/scripts/annotate_pdf.py input.pdf {base_name}_reviewed.pdf
- python skills/document-review/scripts/annotate_pptx.py input.pptx {base_name}_reviewed.pptx
- python skills/document-review/scripts/annotate_xlsx.py input.xlsx {base_name}_reviewed.xlsx
DOCX — read skills/GENERAL-KNOWLEDGE-WORKER/docx/SKILL.md`` and follow its workflow to unpack, edit XML, and repack. Use manage_state.py get-issues to list all issues, then for each issue:
1. Add a comment using comment.py --author "Kortix", then insert <w:commentRangeStart>, <w:commentRangeEnd>, and <w:commentReference> markers in document.xml. Place <w:commentRangeStart> immediately before the first <w:r> that contains the original_text, and <w:commentRangeEnd> immediately after the last <w:r> that contains it — do NOT place these at the paragraph or body level.
  
  Comment text format — every comment MUST use this exact single-line format. Do NOT use \n or line breaks — comment.py renders them as literal text, not actual breaks. No "Suggested" line — the tracked change shows the fix.
```
[Type Label | severity] description
```
  Type label mapping: spelling_grammar → Spelling/Grammar, narrative_logic → Narrative/Logic, non_public_info → Non-Public Info, verify_public_data → Public Data, numerical_consistency → Numerical Consistency.
  
  Example comment.py call for a spelling issue:
```
python skills/GENERAL-KNOWLEDGE-WORKER/docx/scripts/comment.py unpacked/ 0 "[Spelling/Grammar | low] The word 'acheived' is misspelled." --author "Kortix"
```
  Example for a numerical consistency issue:

Setup:
  manage_state.py init "Report.pdf"
  (if applicable) read specialization file for domain-specific guidance

Phase 1: Create sections
  Step 1: manage_state.py add-sections --data '[...]' → 8 sections

Phase 2: Create claims
  Step 2: add-claims "Section 1" + add-claims "Section 2" + add-claims "Section 3" + add-claims "Section 4" (4 parallel)
  Step 3: add-claims "Section 5" + add-claims "Section 6" + add-claims "Section 7" + add-claims "Section 8" (4 parallel)
  → 49 claims created

Phase 3: Update claims
  Step 4: get-claims --status unverified + web_search for facts + bash for calculations (parallel)
  Step 5: update-claims [1-15] + web_search(claims 16-30) (parallel)
  Step 6: update-claims [16-30] + bash(calculations 31-45) (parallel)
  Step 7: update-claims [31-45] + final searches (parallel)
  Step 8: update-claims [46-49]
  → All 49 claims updated

Phase 4: Create issues
  Steps 9-16: manage_state.py add-issues "Section N" --data '[...]' (sequential, one per step)
  → Issues created for all problems

Phase 5: Annotate document
  Step 17: python skills/document-review/scripts/annotate_pdf.py Report.pdf Report_reviewed.pdf
  → Annotated document saved to workspace

Phase 6: Submit review
  Step 18: manage_state.py submit "Review summary"

issue:2

root_issue_id

issue:1

non_public_info

python skills/GENERAL-KNOWLEDGE-WORKER/docx/scripts/comment.py unpacked/ 1 "[Numerical Consistency | high] Region totals sum to 800, not the stated 900." --author "Kortix"

<w:commentRangeStart w:id="0"/>
<w:del w:id="1" w:author="Kortix" w:date="{timestamp}">
  <w:r><w:rPr><!-- original formatting --></w:rPr><w:delText>original text</w:delText></w:r>
</w:del>
<w:ins w:id="2" w:author="Kortix" w:date="{timestamp}">
  <w:r><w:rPr><!-- original formatting --></w:rPr><w:t>new text</w:t></w:r>
</w:ins>
<w:commentRangeEnd w:id="0"/>
<w:r><w:rPr><w:rStyle w:val="CommentReference"/></w:rPr><w:commentReference w:id="0"/></w:r>

Document Review | Skills Pool

Location	Type	Issue	Finding
p. 3	Public Data	Revenue stated as $28T	Publicly reported figure is $25.5T for 2023
p. 7	Numerical	Region totals sum to 900	Correct sum is 800 (500 + 300)

Document Review

Document Review

Ground Rules

Definitions

Sections

Claims

Claim Types

Claim Statuses

Issues

Issue Types

Severity Levels

Background Session Setup

Document Type and Specializations

Workflow

Processing Strategy

Phase 1 (Create sections)

Phase 2 (Create claims)

Phase 3 (Update claims)

Phase 4 (Create issues)

Phase 5 (Annotate document)

Phase 6 (Submit review)

Example Workflow

Output

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing