Name: Hotel Voice Interaction
Author: cagegit

Search skills.../

客人：我办一下入住
小瑞：好的，请问您的姓名或预订号？
客人：张伟
小瑞：找到了！张伟先生，预定了豪华大床房一间，今晚入住两晚，总价 776 元。对吗？
客人：对
小瑞：好的，请出示一下身份证，我帮您扫描录入。

客人：[拨通电话音效] 喂，前台吗？
小瑞：您好，这里是酒店前台，我是小瑞。请问有什么可以帮您？

客人：帮我查一下 305 房间的状态
小瑞：正在查询...305 是豪华大床房，当前状态：**已清洁，待入住**。可以安排新客人入住。

{
  "talk": {
    "ttsEngine": "coqui",
    "baseUrl": "http://127.0.0.1:8000",
    "endpoint": "/v1/text-to-speech",
    "voiceId": "default", // Coqui TTS 默认音色，也可用 model_name
    "outputFormat": "mp3_44100_128", // 根据 Coqui 支持调整
    "interruptOnSpeech": true
  },
  "fallback": {
    "ttsProvider": "elevenlabs",
    "apiKey": "sk-xxxxx", // 保留 ElevenLabs key 作为 backup
    "voiceId": "pNInz6obpgDQGcFmaJgB"
  },
  "voiceWake": {
    "enabled": true,
    "keywords": ["小瑞", "你好", "前台"]
  }
}

# 调用 Coqui TTS 并播放
curl -X POST http://127.0.0.1:8000/v1/text-to-speech \
  -H "Content-Type: application/json" \
  -d '{"text":"您好，这里是酒店前台，我是小瑞。请问有什么可以帮您？"}' \
  -o /tmp/tts_output.wav && afplay /tmp/tts_output.wav

# 如果返回的是音频流，直接输出到 stdout 再播放
curl -s -X POST http://127.0.0.1:8000/v1/text-to-speech \
  -H "Content-Type: application/json" \
  -d '{"text":"您的预订已确认"}' | afplay

speak() {
  local text="$1"
  local output="/tmp/tts_$(date +%s).wav"
  
  if curl -s -X POST "http://127.0.0.1:8000/v1/text-to-speech" \
    -H "Content-Type: application/json" \
    -d "{\"text\":\"$text\"}" \
    -o "$output"; then
    afplay "$output"
    rm -f "$output"
  else
    echo "TTS 服务不可用，切换到备用方案..."
    say "$text"
  fi
}

# 使用
speak "欢迎光临酒店，请问有什么可以帮您？"

import requests

def coqui_tts(text, model="tts_models/zh-CN/baker/tacotron2-DDC"):
    resp = requests.post(
        "http://127.0.0.1:8000/v1/text-to-speech",
        json={"text": text, "model_id": model}
    )
    if resp.status_code == 200:
        return resp.content  # 音频二进制
    return None

问题	解决方法
`afplay: file not found`	检查输出路径是否正确，或用 `ffplay` 代替
没有声音	检查系统音量，或 `say "test"` 测试 macOS 原生 TTS
连接拒绝	确认 Coqui TTS 服务正在运行：`curl http://127.0.0.1:8000/health`

# 方式 1: Python 直接启动
pip install TTS
tts --list_models

tts --model_name tts_models/zh-CN/baker/tacotron2-DDC \
    --text "测试语音" \
    --out_path output.wav

# 方式 2: 使用 API server
python -m TTS.server --listen 127.0.0.1:8000

问题	可能原因	解决方案
听不到声音	Coqui TTS 服务未启动	检查 `curl http://127.0.0.1:8000/health`
连接超时	API 端口/路由错误	确认 `/v1/text-to-speech` 路径正确
返回乱码	编码不匹配	检查 response content-type 为 audio/mpeg
音色奇怪	模型不支持中文	改用 `tts_models/zh-CN` 系列模型
服务崩溃	显存不足	降低 batch_size 或用 CPU 模式运行

Hotel Voice Interaction | Skills Pool

Hotel Voice Interaction

Hotel Voice Interaction

🎤 语音交互 (Voice Interaction)

触发条件

唤醒词

语音交互规范

回复风格（语音场景）

文字对话风格

Talk Mode 配置

TTS 语音合成路径（按优先级）

1️⃣ Coqui TTS (本地) ⭐ 已启用

2. ElevenLabs (备选) - 云端高质量

3. macOS `say` (兜底) - 系统内置

STT 语音识别路径

常见场景示例

入住登记

电话接听模拟

房间查询

配置说明

实际调用方法（前端 Agent 使用）

Shell 命令行方式

带错误处理的封装函数

Python 调用示例（适合后台任务）

在 Agent 对话中使用

故障排查

启动 Coqui TTS 服务

本地测试提示

Troubleshooting

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

Hotel Voice Interaction

Hotel Voice Interaction

🎤 语音交互 (Voice Interaction)

触发条件

唤醒词

语音交互规范

回复风格（语音场景）

文字对话风格

Talk Mode 配置

TTS 语音合成路径（按优先级）

1️⃣ Coqui TTS (本地) ⭐ 已启用

2. ElevenLabs (备选) - 云端高质量

3. macOS say (兜底) - 系统内置

STT 语音识别路径

常见场景示例

入住登记

电话接听模拟

房间查询

配置说明

实际调用方法（前端 Agent 使用）

Shell 命令行方式

带错误处理的封装函数

Python 调用示例（适合后台任务）

在 Agent 对话中使用

故障排查

启动 Coqui TTS 服务

本地测试提示

Troubleshooting

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

3. macOS `say` (兜底) - 系统内置