Guide for cloning voices with ElevenLabs. Sample prep, quality tips, ethical use.
Use /pocket-knife:elevenlabs-voice-cloner to create a custom voice from audio samples.
The resulting voice_id works with /pocket-knife:elevenlabs-tts and /pocket-knife:elevenlabs-voice-changer.
ElevenLabs analyzes uploaded audio to extract voice characteristics — timbre, pitch, cadence, and accent. The clone is stored in your ElevenLabs account and assigned a voice_id that you can reuse across TTS requests.
| Requirement | Minimum | Recommended |
|---|---|---|
| Duration | 1 minute | 3–5 minutes |
| Noise floor | Clean (no music) | Completely silent background |
| Speakers | 1 only | 1 only |
| Format |
| MP3, WAV |
| WAV 44.1kHz 16-bit |
| Content | Clear speech | Varied sentences |
Step 1 — Record or source audio
Step 2 — Clean the audio (if needed)
Step 3 — Content variety Include sentences with:
Voice cloning carries serious ethical responsibilities:
You must only clone:
Never clone a voice to:
ElevenLabs Terms of Service require you to confirm you have the rights to clone any voice you upload.
After cloning, you receive a voice_id. Use it with:
/pocket-knife:elevenlabs-tts
TEXT: "Your script here"
VOICE: [your_voice_id]
The voice_id persists in your ElevenLabs account — you do not need to re-clone for each use.
| Symptom | Likely cause | Fix |
|---|---|---|
| Robotic or flat output | Monotone training audio | Re-record with more expressive reading |
| Wrong accent detected | Short or unclear sample | Add more varied speech, minimum 3 min |
| Noisy output | Background noise in sample | Use audio editor to clean the sample |
| Missing emotion | Single-tone reading | Include varied emotional content in sample |
| Clone sounds generic | Too little audio | Increase to 5+ minutes |
/pocket-knife:elevenlabs-voice-cloner with the file path and a namevoice_id/pocket-knife:elevenlabs-tts with that voice_id to generate narration