Visual inspection and classification of tax documents. Scans every page (never trust filenames), classifies by IRS form type, files to Paperless. Use when tax documents arrive via Gmail or Drive and need to be discovered, classified, and filed, or when checking completeness of tax document collection.
Scan, visually inspect, and classify every tax document across Gmail, Drive, and Paperless. The cardinal rule: never trust filenames — every page must be visually inspected and classified from its actual content.
| Task | Tool | Notes |
|---|---|---|
| Search Gmail for tax attachments | google-workspace-personal-ro.search_gmail_messages | Date-range + keyword queries |
| Search Drive for tax folders | google-workspace-personal-ro.search_drive_files | Folder and filename scans |
| Download Drive file | google-workspace-personal-ro.get_drive_file_content | Retrieve for visual inspection |
| Search Paperless documents | paperless.search_documents | Find untagged or recently added docs |
| Get Paperless document | paperless.get_document | Metadata and content retrieval |
| Download Paperless document | paperless.download_document | Full document for page-level inspection |
| Upload to Paperless | paperless.post_document | Ingest with metadata |
| Update Paperless metadata | paperless.update_document | Correct tags, title, correspondent |
| List Paperless tags | paperless.list_tags | Resolve tag IDs at runtime |
Scan all document sources for tax-season candidates.
Gmail — search for tax-season attachments:
W-2 OR 1099 OR K-1 OR 1098 OR 1095 OR 5498 OR "tax document" OR "tax form" scoped to the relevant date range (typically Jan 1 through Apr 15 of the filing year).Drive — search for tax folders and loose documents:
Tax/TY 2025/) and run keyword queries.Paperless — search for recently added untagged documents:
tax-relevant tag or without any tags at all.Build candidate list with columns: source (Gmail/Drive/Paperless), filename, date received/modified, sender or folder path, preliminary guess from filename.
For every candidate document — no exceptions:
references/tax-doc-types.md for visual identification hints per form type.Classify each inspected document using references/tax-doc-types.md.
For each document, extract:
Flag discrepancies:
File each classified document to Paperless following the conventions in references/tax-doc-types.md (Filing Conventions section) and the document-filing skill taxonomy.
Generate a filing-season completeness checklist.
Expected documents — based on prior year sources:
Received vs expected:
| Source | Form | TY | Status | Paperless ID |
|--------|------|----|--------|--------------|
| Acme Corp | W-2 | 2025 | Received | #1234 |
| Chase Bank | 1099-INT | 2025 | Received | #1235 |
| Fidelity | 1099-DIV | 2025 | MISSING | — |
Flag gaps: "Missing 1099-DIV from Fidelity (received last year on 2025-02-12)."
Flag duplicates: same form type from same source found in multiple locations.
Recommended follow-up: contact payer, check online portal, wait for late mailing.
Every run must return:
| Field | Type | Description |
|---|---|---|
as_of | ISO 8601 timestamp | When the scan was performed |
documents_scanned | integer | Total candidate documents inspected |
classification_results | list | Each entry: {source, filename, classified_type, tax_year, payer, recipient_last4, key_amounts, paperless_doc_id} |
discrepancies | list | Filename-vs-content mismatches, year mismatches, unknown recipients |
completeness_checklist | list | Expected vs received per source and form type |
filing_results | list | Paperless upload/update confirmations with document IDs |
gaps | list | Missing documents with prior-year context |
recommended_actions | list | Follow-up steps for gaps and discrepancies |
1099.pdf might contain a W-2, a cover letter, or a completely unrelated document.references/tax-doc-types.md../document-filing/SKILL.md../paperless-canonical-ingestion/SKILL.md