Transform any text into emotionally expressive audio with ambient soundscapes using ElevenLabs v3 audio tags and Sound Effects API
Transform any text into emotionally expressive audio with ambient soundscapes. MoodCast analyzes your content, adds expressive delivery using ElevenLabs v3 audio tags, and layers matching ambient soundscapes.
Use MoodCast when the user wants to:
Trigger phrases: "read this dramatically", "make this sound good", "create audio for", "moodcast this", "read with emotion", "narrate this"
Slash command: /moodcast
Automatically analyzes text and inserts appropriate v3 audio tags:
[excited][nervous][angry][sorrowful][calm][happy][whispers], [shouts], [rushed], [slows down][laughs], [sighs], [gasps], [giggles], [crying][pause], [breathes], [stammers], [hesitates]Creates matching background audio using Sound Effects API:
For conversations/scripts, assigns different voices to speakers with appropriate emotional delivery.
python3 {baseDir}/scripts/moodcast.py --text "Your text here"
python3 {baseDir}/scripts/moodcast.py --text "Your text here" --ambient "coffee shop background noise"
python3 {baseDir}/scripts/moodcast.py --text "Your text here" --output story.mp3
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood dramatic
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood calm
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood excited
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood scary
python3 {baseDir}/scripts/moodcast.py --list-voices
python3 {baseDir}/scripts/moodcast.py --text "Your text" --voice VOICE_ID --model eleven_v3 --output-format mp3_44100_128
The skill automatically detects and enhances:
| Text Pattern | Audio Tag Added |
|---|---|
| "amazing", "incredible", "wow" | [excited] |
| "scared", "afraid", "terrified" | [nervous] |
| "angry", "furious", "hate" | [angry] |
| "sad", "sorry", "unfortunately" | [sorrowful] |
| "secret", "quiet", "between us" | [whispers] |
| "!" exclamations | [excited] |
| "..." trailing off | [pause] |
| "haha", "lol" | [laughs] |
| Questions | Natural rising intonation |
Input:
Breaking news! Scientists have discovered something incredible.
This could change everything we know about the universe...
I can't believe it.
Enhanced Output:
[excited] Breaking news! Scientists have discovered something incredible.
[pause] This could change everything we know about the universe...
[gasps] [whispers] I can't believe it.
Input:
It was a dark night. The old house creaked.
Something moved in the shadows...
"Who's there?" she whispered.
Enhanced Output:
[slows down] It was a dark night. [pause] The old house creaked.
[nervous] Something moved in the shadows...
[whispers] "Who's there?" she whispered.
ELEVENLABS_API_KEY (required) - Your ElevenLabs API keyMOODCAST_DEFAULT_VOICE (optional) - Default voice ID (defaults to CwhRBWXzGAHq8TQ4Fs17)MOODCAST_MODEL (optional) - Default model ID (defaults to eleven_v3)MOODCAST_OUTPUT_FORMAT (optional) - Default output format (defaults to mp3_44100_128)MOODCAST_AUTO_AMBIENT (optional) - Set to "true" for automatic ambient sounds when using --moodConfiguration Priority: CLI arguments override environment variables, which override hardcoded defaults.
[whispers] not [WHISPERS]Built by ashutosh887
Using ElevenLabs Text-to-Speech v3 + Sound Effects API
Created for #ClawdEleven Hackathon