文档
Pdf
Use this skill whenever PDF files are involved in any way for the BC-DATASET pipeline. Triggers include: reading or extracting text from PDFs, splitting PDFs, creating PDF reports, processing Brazilian legal PDFs (MP-GO corpus), implementing the PDF→chunks pipeline (ADR-014), or any mention of a .pdf file. DO NOT USE for Word documents (.docx), spreadsheets, or training scripts.