Transcribe Audio

Transcribe an audio file locally with speaker diarization. All processing happens on your machine — no data leaves your device.

How to use

Run the CLI transcription tool via Bash. The plugin directory is:

PLUGIN_DIR=$(dirname "$(dirname "$(which transcribe_cli.py 2>/dev/null || echo "")")")

Use the following command pattern:

uv run --directory {PLUGIN_DIR} python transcribe_cli.py "{audio_file}" [options]

Where {PLUGIN_DIR} is the absolute path to the transcription plugin directory (the directory containing transcribe_cli.py). To find it, look for the transcription plugin in the installed plugins — it will be under plugins/transcription/ in the monkey-tools plugin directory.

Arguments

Transcribe Audio

Transcribe an audio file locally with speaker diarization. All processing happens on your machine — no data leaves your device.

How to use

Run the CLI transcription tool via Bash. The plugin directory is:

PLUGIN_DIR=$(dirname "$(dirname "$(which transcribe_cli.py 2>/dev/null || echo "")")")

Use the following command pattern:

uv run --directory {PLUGIN_DIR} python transcribe_cli.py "{audio_file}" [options]

Argument	Description
`audio_file`	(required) Path to the audio file (.m4a, .mp3, .wav, .flac, .ogg, .aac, .mp4)
`--language LANG`	Language code (e.g., `en`, `es`). Auto-detected if omitted.
`--skip-diarization`	Skip speaker identification for faster processing.
`--num-speakers N`	Exact number of speakers if known.
`--min-speakers N`	Minimum expected number of speakers.
`--max-speakers N`	Maximum expected number of speakers.
`--model REPO`	MLX model override (e.g., `mlx-community/whisper-large-v3-turbo` for speed).
`--export FORMAT`	Output format: `txt` (default), `json`, or `srt`.

Transcribe

Transcribe Audio

How to use

Arguments

Transcribe

Transcribe Audio

How to use

Arguments

Examples

Notes

Platform Notes

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api