Music Listener | Skills Pool

技能檔案

Music Listener

Listen to and appreciate music files. Analyze audio for genre, mood, tempo, and lyrics. Use when users share audio/music files, ask about songs, or want music analysis.

Iroli0076 星標2026年3月29日

職業
分類: 媒體

技能內容

Listen to and appreciate music files. Analyze audio for genre, mood, tempo, and lyrics.

When to Use

User shares an audio/music file and asks about it
User asks you to listen to or comment on a song
User asks "what song is this" or "what do you think of this music"
User sends a voice note containing music

Tools Required

Bash (for ffprobe, ffmpeg, whisper)
Read
view_image

How It Works

Step 1: Audio Info (ffprobe)

ffprobe -v quiet -print_format json -show_format -show_streams "<audio_file>"

Key info: duration, bitrate, sample_rate, codec, title/artist/album tags (if present).

相關技能

ffmpeg -i "<audio_file>" -lavfi showspectrumpic=s=800x200:mode=combined:color=intensity -frames:v 1 "/tmp/music_spec_<id>.png" -y

view_image(path="/tmp/music_spec_<id>.png")

# First convert to wav if needed
ffmpeg -i "<audio_file>" -acodec pcm_s16le -ar 16000 -ac 1 "/tmp/music_audio.wav" -y
whisper "/tmp/music_audio.wav" --model turbo --output_format txt --output_dir /tmp/music_whisper

Read(targetPath="/tmp/music_whisper/<file>.txt")

ffmpeg -i "<audio_file>" -ss 60 -t 120 -acodec pcm_s16le -ar 16000 -ac 1 "/tmp/music_segment.wav" -y