스킬 파일

Elevenlabs Transcribe

Name: Elevenlabs Transcribe
Author: qdhenry

Transcribes audio/video files using ElevenLabs Scribe v2 API. Use when transcribing audio files, generating transcripts, or converting speech to text.

qdhenry1,178 스타2026. 3. 1.

직업
카테고리: 미디어

스킬 내용

<objective> Transcribe audio or video files using the ElevenLabs Speech-to-Text API (Scribe v2). Accepts a file path and optional parameters, reads the API key from the project's .env file, and returns a formatted transcription with speaker diarization and audio event tagging. </objective>

<quick_start> Via slash command: /elevenlabs-transcribe path/to/audio.mp3 /elevenlabs-transcribe path/to/audio.mp3 --output transcript.txt --num-speakers 3

Requirements:

ELEVENLABS_API_KEY in the project's .env file
uv installed (dependencies auto-install via PEP 723) </quick_start>

<prerequisites> Before transcribing, verify:

uv is available (dependency installation is automatic via inline script metadata — no venv or manual pip install needed)
API key configured in the .env file where Claude is running:
```
ELEVENLABS_API_KEY=your-key-here
```
Audio file exists and is a supported format (mp3, wav, mp4, m4a, ogg, flac, webm, etc.)

관련 스킬

Elevenlabs Transcribe | Skills Pool

grep -q "ELEVENLABS_API_KEY=" .env 2>/dev/null && echo "API key configured" || echo "API key missing"

uv run ~/.claude/skills/elevenlabs-transcribe/scripts/transcribe.py "<audio_file_path>"

uv run ~/.claude/skills/elevenlabs-transcribe/scripts/transcribe.py "<audio_file_path>" --output transcript.txt --language eng --num-speakers 3

uv run ~/.claude/skills/elevenlabs-transcribe/scripts/transcribe.py "<audio_file_path>" --keyterms "technical term" "product name"

uv run ~/.claude/skills/elevenlabs-transcribe/scripts/transcribe.py "<audio_file_path>" --json --output result.json

[Speaker 0]: Hello, how are you doing today?
[Speaker 1]: I'm doing great, thanks for asking! (laughter)

Flag	Description	Default
`<file>`	Path to audio/video file (required)	-
`--output <path>`, `-o`	Save transcription to file	stdout
`--language <code>`	ISO-639 code (eng, spa, fra, deu, jpn, zho)	auto-detect
`--num-speakers <n>`	Max speakers in audio (1-32)	auto-detect
`--keyterms "t1" "t2"`	Terms to bias transcription towards (max 100)	none
`--timestamps <level>`	Granularity: none, word, character	word
`--no-diarize`	Disable speaker identification	diarize enabled
`--no-audio-events`	Disable audio event tagging	events enabled
`--json`	Output full JSON response	formatted text
</script_options>

Error	Resolution
`ELEVENLABS_API_KEY not found`	Add key to `.env` file in current directory
`uv: command not found`	Install uv: `curl -LsSf https://astral.sh/uv/install.sh` pipe to `sh`
`File not found`	Verify the file path and expand any `~`
`422 Validation Error`	Check file format/size, ensure model_id is valid
`401 Unauthorized`	API key is invalid or expired
</error_handling>

Elevenlabs Transcribe

Elevenlabs Transcribe

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api