Tool-only paper processing skill with a manual language parameter: supports batch artifact download for many papers or single-paper download, then the model manually reads source/PDF and writes summary.md in the selected language. Use when per-paper comprehension should be model-driven instead of script-generated.
Use this skill for per-paper manual summarization, with optional batch artifact download.
<run_dir>/<arxiv_id>/).English or Chinese) and apply it manually.summary.md must be written in the selected language.--language <LANG> for traceability.Scripts only fetch artifacts. The model performs reading and writing.
summary.md by script-based snippet extraction, regex harvesting, or template autofill.source/pdf) and trace logs.summary.md must come from model-side reading and synthesis of the paper content.Use this first when Stage B has many papers:
python3 scripts/download_papers_batch.py \
--run-dir /path/to/run \
--artifact source_then_pdf \
--max-workers 3 \
--min-interval-sec 5 \
--language English
Key behavior:
--artifact source, --artifact pdf, or --artifact source_then_pdf (default).--max-workers) and safe throttling/retry (--min-interval-sec, retry args).<run_dir>/.runtime/arxiv_download_state.json) to reduce 429 risk.source/source_extract/*.tex or existing source/paper.pdf (unless --force).summary.md, you can skip that paper's summary-writing step.<run_dir>/download_batch_log.json by default.python3 scripts/download_arxiv_source.py \
--paper-dir /path/to/run/2602.00528 \
--language English
This writes:
source/source_bundle.binsource/source_extract/source/download_source_log.jsonIf usable source already exists and --force is not set, the script reuses local artifacts.
python3 scripts/download_arxiv_pdf.py \
--paper-dir /path/to/run/2602.00528 \
--language English
This writes:
source/paper.pdfsource/download_pdf_log.jsonIf PDF already exists and --force is not set, the script reuses local artifacts.
summary.md already exists and follows the required format, skip this paper and mark it complete.metadata.md first.source/source_extract/ already exists with readable .tex files, use it directly.source/paper.pdf already exists, use PDF directly.summary.md in the same paper directory, in the selected language.Do not rely on rule-based auto summarization. Do not rely on auto-extracted snippets as the primary writing basis.
references/summary-example-en.md and references/summary-example-zh.md.<paper_dir>/summary.md in fixed section format.## 10. Brief Conclusion: write a 3-4 sentence mini-conclusion that covers contribution, method, evaluation setup, and results with paper-specific details.## 1. Paper Snapshot, use exact keys: ArXiv ID, Title, Authors, Publish date, Primary category, Reading basis.Reading source, Author list, Published on, or lowercase key names.See references/summary-format.md for exact section requirements.
This skill is a sub-skill of arxiv-summarizer-orchestrator.
Pipeline position:
arxiv-search-collector produces the selected paper directories and metadata.arxiv-paper-processor downloads artifacts and writes one summary.md per paper.arxiv-batch-reporter uses these per-paper summaries to generate the final collection report.Use this skill together with Step 1 and Step 3 for full end-to-end execution.