Name: Transcribe2sub
Author: scrpr

Transcribe2sub

Generate or refine high-quality transcription subtitles from audio or video with ElevenLabs STT, word-level timestamps, token-range editing, ASR error correction, terminology consistency, optional user-provided glossaries, and SRT/JSON round-tripping. Use when the user needs 音频/视频转字幕, 高质量转写字幕, 合理语义分段, 准确时间轴, 错词纠正, 专有名词统一, glossary-driven review, punctuation or casing cleanup, or an agent-editable transcript that must render back to SRT without losing token coverage.

scrpr1 스타2026. 4. 7.

직업
카테고리: 미디어

Use this skill as a quality-first subtitle workflow, not a raw ASR dump.

Default Strategy

Use two subagents by default for agentic runs:
- Subagent 1: transcription/build agent. Its job is only to run the script, generate or rebuild <stem>.review.json, and preserve raw artifacts.
- Subagent 2: review/QA agent. Its job is only to inspect qa_flags, fix segmentation/timing/text issues inside the editable JSON, and save <stem>.corrected.json.
Prefer json -> review -> corrected json -> srt.
Preserve the raw ElevenLabs response JSON on the first API run, then prefer raw json -> review json -> corrected json -> srt for later iterations.
Use direct SRT output only when the user explicitly wants a fast draft or a raw first pass.
Optimize in this order: transcript fidelity -> timing fidelity -> semantic segmentation -> readability polish.

Required

On the first run after the skill is installed, request approval if needed and run cd skills/transcribe2sub && pnpm install before invoking the script.

Use this skill as a quality-first subtitle workflow, not a raw ASR dump.

Default Strategy

Use two subagents by default for agentic runs:
- Subagent 1: transcription/build agent. Its job is only to run the script, generate or rebuild <stem>.review.json, and preserve raw artifacts.
- Subagent 2: review/QA agent. Its job is only to inspect qa_flags, fix segmentation/timing/text issues inside the editable JSON, and save <stem>.corrected.json.
Prefer json -> review -> corrected json -> srt.
Preserve the raw ElevenLabs response JSON on the first API run, then prefer raw json -> review json -> corrected json -> srt for later iterations.
Use direct SRT output only when the user explicitly wants a fast draft or a raw first pass.
Optimize in this order: transcript fidelity -> timing fidelity -> semantic segmentation -> readability polish.

Required

On the first run after the skill is installed, request approval if needed and run cd skills/transcribe2sub && pnpm install before invoking the script.

Transcribe2sub

Default Strategy

Required

Transcribe2sub

Default Strategy

Required

Optional

Never

Prepare

Quality-First Workflow

Editing Rules

Final Checks

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api