name speak description Convert text into speech with Kokoro or Noiz, including simple and timeline-aligned modes. speak Convert any text into speech audio. Supports two backends (Kokoro local, Noiz cloud), two modes (simple or timeline-accurate), and per-segment voice control. Triggers text to speech / speak / say / tts voice clone / dubbing epub to audio / srt to audio / convert to audio Simple Mode — text to audio

Kokoro (auto-detected when installed)

bash skills/speak/scripts/tts.sh speak -t "Hello world" -v af_sarah -o hello.wav bash skills/speak/scripts/tts.sh speak -f article.txt -v zf_xiaoni --lang cmn -o out.mp3 --format mp3

Noiz (auto-detected when NOIZ_API_KEY is set, or force with --backend noiz)

If --voice-id is omitted, the script prints 5 available built-in voices and exits.

Pick one from the output and re-run with --voice-id <id>.

bash skills/speak/scripts/tts.sh speak -f input.txt --voice-id voice_abc --auto-emotion --emo '{"Joy":0.5}' -o out.wav

Noiz: optional --duration (float, seconds, range (0, 36]) for target audio length

Kokoro (auto-detected when installed)

bash skills/speak/scripts/tts.sh speak -t "Hello world" -v af_sarah -o hello.wav bash skills/speak/scripts/tts.sh speak -f article.txt -v zf_xiaoni --lang cmn -o out.mp3 --format mp3

Noiz (auto-detected when NOIZ_API_KEY is set, or force with --backend noiz)

If --voice-id is omitted, the script prints 5 available built-in voices and exits.

Pick one from the output and re-run with --voice-id <id>.

bash skills/speak/scripts/tts.sh speak -f input.txt --voice-id voice_abc --auto-emotion --emo '{"Joy":0.5}' -o out.wav

Speak

Kokoro (auto-detected when installed)

Noiz (auto-detected when NOIZ_API_KEY is set, or force with --backend noiz)

If --voice-id is omitted, the script prints 5 available built-in voices and exits.

Pick one from the output and re-run with --voice-id <id>.

Noiz: optional --duration (float, seconds, range (0, 36]) for target audio length

Speak

Kokoro (auto-detected when installed)

Noiz (auto-detected when NOIZ_API_KEY is set, or force with --backend noiz)

If --voice-id is omitted, the script prints 5 available built-in voices and exits.

Pick one from the output and re-run with --voice-id <id>.

Noiz: optional --duration (float, seconds, range (0, 36]) for target audio length

Voice cloning (Noiz only — no voice-id needed, uses ref audio)

Use your own reference audio: local file path or URL (only when using Noiz).

Nutrient Document Processing

Nano Pdf

Feishu Doc

Summarize

Visa Doc Translate

Nutrient Document Processing