This skill adds automatic voice message transcription to NanoClaw's WhatsApp channel using OpenAI's Whisper API. When a voice note arrives, it is downloaded, transcribed, and delivered to the agent as [Voice: <transcript>].

Phase 1: Pre-flight

Check if already applied

Check if src/transcription.ts exists. If it does, skip to Phase 3 (Configure). The code changes are already in place.

Ask the user

Use AskUserQuestion to collect information:

AskUserQuestion: Do you have an OpenAI API key for Whisper transcription?

If yes, collect it now. If no, direct them to create one at https://platform.openai.com/api-keys.

Phase 2: Apply Code Changes

Prerequisite: WhatsApp must be installed first ( merged). This skill modifies WhatsApp channel files.

Phase 1: Pre-flight

Check if already applied

Check if src/transcription.ts exists. If it does, skip to Phase 3 (Configure). The code changes are already in place.

Ask the user

Use AskUserQuestion to collect information:

AskUserQuestion: Do you have an OpenAI API key for Whisper transcription?

If yes, collect it now. If no, direct them to create one at https://platform.openai.com/api-keys.

Phase 2: Apply Code Changes

Prerequisite: WhatsApp must be installed first ( merged). This skill modifies WhatsApp channel files.

Add Voice Transcription

Phase 1: Pre-flight

Check if already applied

Ask the user

Phase 2: Apply Code Changes

Add Voice Transcription

Phase 1: Pre-flight

Check if already applied

Ask the user

Phase 2: Apply Code Changes

Ensure WhatsApp fork remote

Merge the skill branch

Validate code changes

Phase 3: Configure

Get OpenAI API key (if needed)

Add to environment

Build and restart

Phase 4: Verify

Test with a voice note

Check logs if needed

Troubleshooting

Voice notes show "[Voice Message - transcription unavailable]"

Voice notes show "[Voice Message - transcription failed]"

Agent doesn't respond to voice notes

Feishu Perm

Discord

Coding Agent (bash-first)

Apple Notes

Feishu Wiki

Bear Notes