技能档案

Local Transcribe

Name: Local Transcribe
Author: darknoon

Transcribe audio files (m4a, mp3, wav, ogg, flac, aac, webm, mp4) to text using local whisper-cpp (offline, no API). Use when asked to transcribe locally, convert speech to text offline, or process audio recordings without sending data externally. Converts to 16kHz WAV via ffmpeg, then runs whisper-cli locally. Does not do speaker diarization or advanced transcription.

darknoon0 星标2026年4月10日

职业
分类: 媒体

技能内容

Audio Transcription

Transcribe audio files to text using whisper-cpp (local, offline).

Script location

skills/local-transcribe/scripts/transcribe.sh

Basic usage

skills/local-transcribe/scripts/transcribe.sh "<audio-file>"

The transcript is printed to stdout. Use --output to save to a file instead.

Options

--output PATH — save transcript to a file instead of stdout
--model PATH — path to a whisper.cpp GGML model file (auto-detected if omitted)

Examples

相关技能

Local Transcribe | Skills Pool

# Transcribe and print to stdout
skills/local-transcribe/scripts/transcribe.sh recording.m4a

# Transcribe and save to file
skills/local-transcribe/scripts/transcribe.sh recording.m4a --output transcript.txt

# Use a specific model
skills/local-transcribe/scripts/transcribe.sh recording.m4a --model ~/.cache/whisper-cpp/ggml-medium.bin

Model	Size	Speed	Quality
`ggml-tiny.bin`	75 MB	Fastest	Lower accuracy
`ggml-base.bin`	142 MB	Fast	Decent
`ggml-small.bin`	466 MB	Moderate	Good (recommended)
`ggml-medium.bin`	1.5 GB	Slower	Better
`ggml-large.bin`	3.1 GB	Slowest	Best

Local Transcribe

Audio Transcription

Script location

Basic usage

Options

Examples

Local Transcribe

Audio Transcription

Script location

Basic usage

Options

Examples

Supported formats

Dependencies

Models

After transcribing

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api