Name: Literature Review Agent (Step 3)
Author: leonardodalinky

Literature Review Agent (Step 3)

Step 3 of the PaperOrchestra pipeline (arXiv:2604.05018). Execute the literature search strategy from outline.json — discover candidate papers via web search, verify them through Semantic Scholar (Levenshtein > 70 fuzzy title match, temporal cutoff, dedup by paperId), build a BibTeX file, and draft Introduction + Related Work using ≥90% of the verified pool. Runs in parallel with the plotting-agent. TRIGGER when the orchestrator delegates Step 3 or when the user asks to "find citations for my paper", "draft the related work", or "build the bibliography".

leonardodalinky88 Sterne13.04.2026

Beruf
Kategorien: LLM & AI

Faithful implementation of the Hybrid Literature Agent from PaperOrchestra (Song et al., 2026, arXiv:2604.05018, §4 Step 3, App. D.3, App. F.1 p.46).

Cost: ~20–30 LLM calls. This is one of the two longest steps (the other is plotting). Wall-time floor is set by Semantic Scholar's 1 QPS verification limit.

Inputs

workspace/outline.json — specifically intro_related_work_plan with the Introduction search directions and the 2-4 Related Work methodology clusters
workspace/inputs/conference_guidelines.md — used to derive cutoff_date
workspace/inputs/idea.md, workspace/inputs/experimental_log.md — for framing the Intro and grounding the Related Work positioning

Outputs

workspace/citation_pool.json — verified Semantic Scholar metadata for every paper that survived verification
workspace/refs.bib — BibTeX file generated from the verified pool
workspace/drafts/intro_relwork.tex — drafted Introduction and Related Work sections, written into the template, with the rest of the template preserved verbatim

Literature Review Agent (Step 3)

leonardodalinky88 Sterne13.04.2026

Beruf
Kategorien: LLM & AI

Inputs

workspace/outline.json — specifically intro_related_work_plan with the Introduction search directions and the 2-4 Related Work methodology clusters

workspace/inputs/conference_guidelines.md — used to derive cutoff_date

workspace/inputs/idea.md, workspace/inputs/experimental_log.md — for framing the Intro and grounding the Related Work positioning

Outputs

workspace/citation_pool.json — verified Semantic Scholar metadata for every paper that survived verification

workspace/refs.bib — BibTeX file generated from the verified pool

workspace/drafts/intro_relwork.tex — drafted Introduction and Related Work sections, written into the template, with the rest of the template preserved verbatim

Venue	Cutoff
CVPR 2025	Nov 2024
ICLR 2025	Oct 2024
Other	One month before the stated submission deadline

Literature Review Agent (Step 3)

Inputs

Outputs

Literature Review Agent (Step 3)

Inputs

Outputs

Two-phase pipeline (App. D.3)

Step-by-step

0. Derive `cutoff_date`

1. Phase 1: Parallel Candidate Discovery

Optional: Exa as a Phase 1 backend

1.5. Pre-dedup before Phase 2

2. Phase 2: Sequential Verification via Semantic Scholar (with cache)

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

Literature Review Agent (Step 3)

Inputs

Outputs

Literature Review Agent (Step 3)

Inputs

Outputs

Two-phase pipeline (App. D.3)

Step-by-step

0. Derive cutoff_date

1. Phase 1: Parallel Candidate Discovery

Optional: Exa as a Phase 1 backend

1.5. Pre-dedup before Phase 2

2. Phase 2: Sequential Verification via Semantic Scholar (with cache)

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

0. Derive `cutoff_date`