Name: Construction PDF Binder Extraction
Author: ruiming2ai

Purpose

Extract multi-page construction plan PDF binders into a vision-first structure that enables an AI agent to efficiently navigate, reference, and respond to specific pages and drawing zones within the plans.

Construction PDFs are uniquely challenging because:

Single PDF pages often contain multiple sub-pages composited together
Text is rendered by CAD software in non-extractable ways
Watermarks (e.g., "Study Set - Not For Construction") inject diagonal characters that pollute text extraction
Drawing content (dimensions, callouts, symbols) carries critical meaning that only vision can interpret
Title 24 energy reports are often rasterized images, not selectable text

When to Use

Invoke this skill when:

A PDF binder of construction plans is provided (typically 10-30+ pages)
Plan check corrections need to reference specific sheets and locations
A permit checklist needs to be generated from submitted plans
Any construction document needs to be made queryable by an AI agent

Method	Drawing Pages	Text-Heavy Pages	Rasterized (Title 24)
pdftotext	Garbage	Usable	Empty
pdfplumber	Reversed text	Good	367 chars
Tesseract OCR	Garbled	Good	Good
Claude Vision	Excellent	Excellent	Excellent

Binder Size	Subagents	Max Concurrent	Approx. Wall Time
9 pages	9	3	~3x single page
15 pages	15	3	~5x single page
26 pages	26	3	~9x single page
30 pages	30	3	~10x single page

Category	Typical Sheets	What to Look For
General	CS (Cover)	Scope of work, sheet index, lot coverage, general notes
Code	AIA.1, AIA.2	CalGreen checklists, compliance checkboxes
Architectural	A1-A4	Site plan, floor plan, elevations, sections, schedules
Structural	SN1-SN2, S1-S3	Notes, foundation, framing, details, shearwall schedules
Energy	T-1 through T-3	CF1R compliance, HVAC specs, mandatory requirements
MEP	M1, P1, E1	Mechanical, plumbing, electrical (not always separate sheets)

Construction PDF Binder Extraction

Construction PDF Binder Extraction

Purpose

When to Use

Why Vision-First (with Tesseract Cross-Reference)

Extraction Process

Step 1: Prepare Output Directory

Step 2: Extract Page PNGs + Tesseract Text

Step 3: Vision Extract Every Page (Rolling Window)

Why One Page Per Subagent

Resource Constraints

Rolling Window Orchestration

Throughput

Output Format

Step 4: Assemble the Manifest

Step 4a: Orchestrator Review (ALWAYS — Not Optional)

Standard Review Checklist

Cross-Page Consistency Check (Critical)

Reading Title Blocks

Drawing Zone Mapping

Step 5: Validate Outputs

Using Extraction Results

For Corrections Response (Flow 2)

For Permit Checklist (Flow 1)

Typical Sheet Types in ADU Binders

Orchestration Summary

Resources

scripts/

prompts/

references/

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing