Name: Audio Response
Author: Giansn

스킬 검색.../

Audio Response | Skills Pool

[Topic] [Duration] [Key Point]

echo "Response text" | espeak --stdout > response.wav

from openai import OpenAI
client = OpenAI()
response = client.audio.speech.create(
    model="tts-1",
    voice="alloy",
    input="Response text"
)

from elevenlabs import generate, play
audio = generate(text="Response text", voice="Rachel")

from gtts import gTTS
tts = gTTS(text="Response text", lang="en")
tts.save("response.mp3")

User Audio → Transcription → Processing → Text Response → TTS → Labeled Audio Response

python3 scripts/audio_responder.py \
  --audio /path/to/user_audio.ogg \
  --tts-engine espeak \
  --label-format "[{topic}] [{duration}s] {key_point}"

# Test TTS
echo "Audio response protocol is now active" | espeak --stdout > test.wav

# Test full workflow
python3 scripts/audio_responder.py --test

def handle_telegram_audio(audio_path):
    # Transcribe
    transcript = transcribe_audio(audio_path)
    
    # Generate response
    response_text = generate_response(transcript)
    
    # Convert to audio
    audio_response = text_to_speech(response_text)
    
    # Send with label
    label = generate_label(transcript, response_text)
    send_audio_response(audio_response, label)

export ELEVENLABS_API_KEY="..."
export OPENAI_API_KEY="..."
export TTS_ENGINE="elevenlabs"  # or openai, google, espeak

{
  "tts_engine": "espeak",
  "label_format": "[{topic}] [{duration}s] {key_point}",
  "fallback_engines": ["openai", "google", "espeak"],
  "max_duration": 30
}

Audio Response

Audio Response Protocol

Overview

Protocol Rules

1. Input: User sends audio message

2. Processing:

3. Output: Send audio-only response with useful label

Labeling Format

Audio Response

Audio Response Protocol

Overview

Protocol Rules

1. Input: User sends audio message

2. Processing:

3. Output: Send audio-only response with useful label

Labeling Format

TTS Options

1. Local (espeak) - Available now

2. OpenAI TTS - Needs API key

3. ElevenLabs - Best quality

4. Google TTS - Free, requires internet

Implementation Scripts

scripts/audio_responder.py

scripts/tts_engine.py

scripts/label_generator.py

Workflow

Quick Start

Using espeak (immediate):

Testing:

Integration with OpenClaw

Telegram Handler:

Auto-Configuration:

Label Generation Examples

Error Handling

TTS Failure:

Transcription Failure:

Label Generation Failure:

Performance Notes

Speed:

Quality:

Configuration

Environment Variables:

Config File (~/.config/audio-response.json):

Testing the Protocol

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api

`scripts/audio_responder.py`

`scripts/tts_engine.py`

`scripts/label_generator.py`

Config File (`~/.config/audio-response.json`):