Skill ファイル

Transcribe

Name: Transcribe
Author: ma08

Transcribe audio/video files using speech-to-text providers. Use when user has audio or video files to transcribe.

ma080 スター2026/02/13

職業: ソフトウェア開発者
カテゴリ: メディア

スキル内容

Transcribe Skill

Transcribe audio and video files to text using speech-to-text providers (currently Soniox).

When to Use

User has audio or video files to transcribe
User wants to convert meeting recordings, interviews, or media to text
User needs batch transcription of a directory of files
User mentions transcription, speech-to-text, or converting audio/video to text

How to Use

Single file

uv run scripts/transcribe.py --input recording.mp4

Directory of files

uv run scripts/transcribe.py --input /path/to/videos/ --output-dir /path/to/output/

関連 Skill

Transcribe | Skills Pool

uv run scripts/transcribe.py --input meeting.m4a \
  --context "Board meeting Q4 review" \
  --terms "EBITDA,YoY,ARR,Zone,Simply South"

Flag	Short	Description	Default
`--input`	`-i`	Audio/video file or directory (required)	--
`--output-dir`	`-o`	Output directory	Same as input
`--provider`	`-p`	STT provider (`soniox`)	`soniox`
`--context`	`-c`	Free-text context for accuracy	`""`
`--terms`	`-t`	Comma-separated domain terms	`""`
`--language`	`-l`	Language hint ISO code	`en`
`--no-cleanup`		Keep remote files after transcription	`false`
`--no-combined`		Skip combined transcript for directories	`false`

uv run scripts/transcribe.py -i ~/Downloads/standup-2026-02-05.m4a \
  -o context/daily/2026-02-05/standup/ \
  --context "Daily standup meeting, Zone team" \
  --terms "Zone,ZonEye,Simply South,Vinoz"

uv run scripts/transcribe.py -i /path/to/factory-videos/ \
  -o context/daily/2026-02-05/factory-tour/ \
  --context "Factory tour at candy manufacturing facility" \
  --terms "tempering,enrobing,fondant,ganache"

Transcribe

Transcribe Skill

When to Use

How to Use

Single file

Directory of files

Transcribe

Transcribe Skill

When to Use

How to Use

Single file

Directory of files

With domain context and terms

Arguments Reference

Supported Formats

Output Files

Requirements

Typical Workflow

Examples in Context

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api