Step 4 of the PaperOrchestra pipeline (arXiv:2604.05018). ONE single multimodal LLM call that drafts the remaining paper sections (Abstract, Methodology, Experiments, Conclusion), extracts numeric values from experimental_log.md into LaTeX booktabs tables, splices the generated figures from Step 2, and merges everything into the template that already contains Intro + Related Work from Step 3. TRIGGER when the orchestrator delegates Step 4 or when the user asks to "write the methodology and experiments sections" or "fill in the rest of the paper".
Faithful implementation of the Section Writing Agent from PaperOrchestra (Song et al., 2026, arXiv:2604.05018, §4 Step 4, App. F.1 pp. 47–49).
Cost: ONE LLM call (App. B: "Section Writing Agent (1 call): A single, comprehensive multimodal call to draft and compile the complete LaTeX manuscript"). Do NOT split this into per-section calls — the paper explicitly designs it as one comprehensive call so the model can maintain global coherence across sections.
workspace/outline.json — the master planworkspace/inputs/idea.md — technical detailsworkspace/inputs/experimental_log.md — raw data for tables and qualitative analysisworkspace/drafts/intro_relwork.tex — the template with Intro + Related
Work already filled in by Step 3. This is your starting point. The
preamble, package list, style, and the two pre-filled sections must be
preserved verbatim.workspace/citation_pool.json — the citation map ({key, title, abstract}
for each verified paper)workspace/refs.bib — the BibTeX fileworkspace/inputs/conference_guidelines.md — formatting rulesworkspace/figures/ — the actual PNG files from Step 2 (used as
multimodal vision input!)workspace/figures/captions.json — caption text per figure_idworkspace/tex_profile.json — TeX package availability flags (written by
check_tex_packages.py at Step 0). Read this before generating any
LaTeX. It tells you which packages are installed so you select the right
cross-reference pattern, font packages, etc. before you write — not after
you try to compile.workspace/drafts/paper.tex — the complete LaTeX paper, with all sections
filled. The Step 5 Refinement Agent will iterate on this file.Before composing the prompt, read workspace/tex_profile.json and apply
these rules to every LaTeX choice in the generated paper:
| Profile flag | True → use | False → use instead |
|---|---|---|
use_cleveref | \cref{fig:X}, \cref{tab:Y} | Figure~\ref{fig:X}, Table~\ref{tab:Y} |
use_nicefrac | \nicefrac{a}{b} | $a/b$ |
use_microtype | \usepackage{microtype} | omit the line |
use_t1_fontenc | \usepackage[T1]{fontenc} | omit the line |
If tex_profile.json does not exist (old workspace), default to the safe
fallback column (no cleveref, no nicefrac, no microtype, no T1 fontenc).
Run the deterministic helper:
python skills/section-writing-agent/scripts/extract_metrics.py \
--log workspace/inputs/experimental_log.md \
--out workspace/metrics.json
This parses the ## 2. Raw Numeric Data section's markdown tables into
structured JSON. The Section Writing Agent uses this to construct LaTeX
booktabs tables without re-deriving values from raw text. Read
references/latex-table-patterns.md for the booktabs conventions.
Load references/prompt.md (verbatim Section Writing Agent prompt from App.
F.1). Prepend the Anti-Leakage Prompt from
../paper-orchestra/references/anti-leakage-prompt.md.
The user message contains:
outline.json — full contentidea.md — full contentexperimental_log.md — full content (tables AND prose)intro_relwork.tex — full content (this becomes template.tex for the prompt)citation_pool.json — full content (becomes citation_map.json)conference_guidelines.md — full contentfigures_list — array of {figure_id, filename, caption} from
captions.json and the file listingIf your host LLM has no vision input, fall back to text-only mode: pass the
captions in captions.json as descriptions and tell the agent it cannot see
the images directly. Quality drops noticeably (the paper notes that visual
grounding measurably improves figure-text alignment), but the pipeline
still completes.
The agent's response is wrapped in \``latex ... ```fences. Extract the LaTeX code and save toworkspace/drafts/paper.tex`.
# Orphan citation gate: every \cite{KEY} must exist in refs.bib
python skills/section-writing-agent/scripts/orphan_cite_gate.py \
workspace/drafts/paper.tex workspace/refs.bib
# Latex sanity: matched braces, matched begin/end, no unescaped specials
python skills/section-writing-agent/scripts/latex_sanity.py \
workspace/drafts/paper.tex
# Anti-leakage post-check: no author names, emails, affiliations
python skills/paper-orchestra/scripts/anti_leakage_check.py \
workspace/drafts/paper.tex
If any gate fails, re-prompt the writing call with the gate's error report appended to the user message and ask the agent to fix the specific issues. Do NOT try to fix the gate violations by hand — the model needs to see its own mistakes.
These are excerpted from references/prompt.md (App. F.1, pp. 47-49). The
host agent MUST honor them on the writing call:
intro_relwork.tex. Preserve Intro + Related Work verbatim.experimental_log.md. Do not
hallucinate numbers — use the exact values in the log.booktabs package format: \toprule, \midrule, \bottomrule.outline.json provides citation_hints per subsection. For each hint,
find the matching key in citation_pool.json (by title or content) and
use that exact key in \cite{...}.refs.bib. Inventing or guessing keys violates the
Lit Review Agent's verified pool.citation_pool.json for the papers you cite.
Use the abstract context to write specific, accurate sentences about
those works — not generic "[A, B] proposed methods for X".outline.json's section_plan
structure exactly. Hierarchy rule: if 4.1 exists, 4.2 must exist.idea.md or experimental_log.md.
Do not hallucinate math. Do not use complex math just for the sake
of it.figures/. Use the exact filenames
including extensions (e.g., .png) in your \includegraphics commands.\begin{figure}) unless they are very wide.captions.json and should generally be used as-is.template.tex.
Do not change the overall LaTeX style.\begin{X} must match a \end{X} (e.g., \begin{figure*} must be
closed with \end{figure*}, not \end{figure}).\usepackage[capitalize]{cleveref} to
\usepackage[capitalize]{cleverref} — there is no cleverref.sty.\clearpage immediately before \bibliographystyle{...}.
Without it, figures deferred by LaTeX's float algorithm will appear inside
or after the References section — a hard-to-spot layout defect that only
shows up in the compiled PDF. \clearpage forces all pending floats to be
output before the bibliography starts. See
references/latex-table-patterns.md for details.Figure~\ref{fig:X} and Table~\ref{tab:Y}
over bare \ref{fig:X}. This is necessary when cleveref is unavailable
and produces readable prose in all cases. Use \cref{...} only when
cleveref.sty is confirmed present.template.tex in \``latex ... ````.references/prompt.md — verbatim Section Writing Agent prompt from App. F.1references/latex-table-patterns.md — booktabs rules + table-from-log examplesreferences/figure-integration.md — \includegraphics, 2-column handling, placementscripts/extract_metrics.py — markdown tables in experimental_log → JSONscripts/latex_sanity.py — unmatched braces, env mismatches, specialsscripts/orphan_cite_gate.py — every \cite{KEY} exists in refs.bib