Produce slide decks (and optionally narrated demo videos) from research papers. The human drives all outline and visual decisions — the agent executes.

Pipeline

[1] Script Draft ──→ [2] Slide Generation ──→ [3] TTS Audio (optional) ──→ [4] Video Assembly (optional)
     Claude Code          nanobanana /edit     edge-tts / Kokoro / ElevenLabs       ffmpeg

Skip stages 3–4 for slide-only output. User can enter at any stage.

Stage 1: Script / Outline

Input: paper + user-provided outline or slide plan Output: video-scripts.md or slide-outline.md — per-slide content with talking points

The agent drafts scripts based on the user's outline. The user owns the structure — agent does not decide slide count, order, or what to emphasize.

Stage 2: Slide Generation

Produce slide decks (and optionally narrated demo videos) from research papers. The human drives all outline and visual decisions — the agent executes.

Pipeline

[1] Script Draft ──→ [2] Slide Generation ──→ [3] TTS Audio (optional) ──→ [4] Video Assembly (optional)
     Claude Code          nanobanana /edit     edge-tts / Kokoro / ElevenLabs       ffmpeg

Skip stages 3–4 for slide-only output. User can enter at any stage.

Stage 1: Script / Outline

Input: paper + user-provided outline or slide plan Output: video-scripts.md or slide-outline.md — per-slide content with talking points

The agent drafts scripts based on the user's outline. The user owns the structure — agent does not decide slide count, order, or what to emphasize.

Engine	Quality	Cost	Latency	Best For
edge-tts (default)	Very good	Free, unlimited	~6s/slide (cloud)	Quick generation, good male voices
Kokoro	Very good	Free, unlimited	~1.5s/slide (local)	Offline use, fast batch, good female voices
ElevenLabs	Premium	10k chars free/mo	~3s/slide (cloud)	Highest quality, voice cloning

Tool	Stage	Install
Gemini CLI + nanobanana	2	`gemini extensions install https://github.com/gemini-cli-extensions/nanobanana`
LibreOffice + poppler	2 (PPTX)	`brew install --cask libreoffice && brew install poppler`
edge-tts	3	`pip install edge-tts`
Kokoro	3 (offline)	`pip install kokoro soundfile`
ElevenLabs	3 (premium)	`pip install elevenlabs` + `ELEVENLABS_API_KEY`
ffmpeg	4	`brew install ffmpeg`

Making Academic Presentations

Pipeline

Stage 1: Script / Outline

Stage 2: Slide Generation

Making Academic Presentations

Pipeline

Stage 1: Script / Outline

Stage 2: Slide Generation

Stage 3: TTS Audio (optional)

Engine Selection

Quick Start (edge-tts)

Stage 4: Video Assembly (optional)

PPTX Conversion (if needed)

NotebookLM — Human Reference Only

Gotchas

Dependencies

Update Skills

Eval Harness

Ecc Tools Cost Audit

Code Tour

Rules Distill

Design System