Name: Strudel Music
Author: panlm

Strudel Music | Skills Pool

✅ Correct: spawn a sub-agent or use background exec
❌ Wrong:   run the renderer inline in your main conversation

# Background exec with timeout
exec background:true timeout:120 command:"node src/runtime/chunked-render.mjs src/compositions/my-track.js output/my-track.wav 20"

sessions_spawn task:"Render strudel-music composition: node src/runtime/chunked-render.mjs ..."

# 1. Setup
cd ~/.openclaw/workspace/strudel-music
npm run setup              # installs deps + downloads samples (~11MB)

# 2. Verify
npm test                   # 12-point smoke test

# 3. Render
node src/runtime/chunked-render.mjs assets/compositions/fog-and-starlight.js output/fog.wav 16
ffmpeg -i output/fog.wav -codec:a libmp3lame -b:a 192k output/fog.mp3

Invocation	What it does
`/strudel <prompt>`	Compose from natural language — mood, scene, genre, instruments
`/strudel play <name>`	Stream a saved composition into Discord VC
`/strudel list`	Show available compositions with metadata
`/strudel samples`	Manage sample packs (list, download, add)
`/strudel concert <tracks...>`	Play a setlist in Discord VC

Parse prompt → select mood, key, tempo, instruments (see references/mood-parameters.md)
Write a .js composition using Strudel pattern syntax

Render (in background!):

node src/runtime/chunked-render.mjs <file> <output.wav> <cycles> [chunkSize]

Convert to MP3:

ffmpeg -i output.wav -codec:a libmp3lame -b:a 192k output.mp3

Post the MP3 as attachment or stream to Discord VC

node src/runtime/offline-render-v2.mjs assets/compositions/combat-assault.js /tmp/track.wav 12 140
ffmpeg -i /tmp/track.wav -ar 48000 -ac 2 /tmp/track-48k.wav -y
node scripts/vc-play.mjs /tmp/track-48k.wav

samples/
├── strudel.json          ← sample map (pitch info, paths)
├── kick/
│   └── kick.wav
├── hat/
│   └── hat.wav
├── bass_Cs1/
│   └── bass_Cs1.wav      ← pitched sample (root: C#1)
├── synth_lead/
│   └── synth_lead.wav     ← pitched sample (root: C#3, declared in strudel.json)
└── bloom_kick/
    └── bloom_kick.wav     ← from audio deconstruction

{
  "_base": "./",
  "kick": { "0": "kick/kick.wav" },
  "bass_Cs1": { "cs1": "bass_Cs1/bass_Cs1.wav" },
  "synth_lead": { "cs3": "synth_lead/synth_lead.wav" }
}

bash scripts/samples-manage.sh list              # show installed packs
bash scripts/samples-manage.sh add <url>          # download from URL
bash scripts/samples-manage.sh add ~/my-samples/  # add local directory

sound("bd sd cp hh").bank("RolandTR909")
sound("bd sd hh oh").bank("LinnDrum")

# %USERPROFILE%\.wslconfig
[wsl2]
networkingMode=mirrored

pip install demucs librosa numpy scipy scikit-learn torch

MP3 → Demucs (stem separation) → librosa (analysis) → sample slicing → Strudel composition → render → MP3

# 1. Separate stems (Python/Demucs)
python -m demucs input.mp3 --out ./stems

# 2. Analyze + slice (see docs/pipeline.md for details)
# Currently semi-manual — analysis scripts in development

# 3. Write composition referencing sliced samples
# 4. Render
bash scripts/dispatch.sh render my-composition.js 16 120

# 5. Convert
ffmpeg -i output.wav -c:a libmp3lame -q:a 2 output.mp3 -y

Stage	CPU estimate	GPU estimate
Demucs stem separation	~15s/min of audio	~3s/min of audio
Audio analysis (per stem)	~10–20s	~10–20s
Sample slicing	~5s	~5s
Composition	instant (human/AI writes JS)	instant
Rendering	~30–60s/min of output	~30–60s/min of output
MP3 conversion	~5s	~5s

sessions_spawn({
  task: "Render strudel composition: /strudel dark ambient tension, 65bpm",
  mode: "run",
  runTimeoutSeconds: 600  // 10 minutes — generous for full pipeline
})

exec({ command: "bash scripts/dispatch.sh render ...", background: true })

bash scripts/dispatch.sh render assets/compositions/fog-and-starlight.js 16 72

// WRONG — will timeout after 30s in Discord context
exec({ command: "bash scripts/dispatch.sh render ..." })

// WRONG — blocking the main session for minutes
// (anything inline that takes >30s)

Document	What it covers
`docs/pipeline.md`	Full pipeline stages, commands, timings, resource requirements, system dependencies
`docs/composition-guide.md`	Practical composition lessons — mini-notation pitfalls, the space-vs-angle-bracket rule, `.slow()` interactions, debugging hap explosions
`docs/TESTING.md`	Testing strategy — smoke tests, cross-platform validation, quality gates, naive install testing

setcpm(120/4)  // 120 BPM

stack(
  s("bd sd [bd bd] sd").gain(0.4),           // drums (samples)
  s("[hh hh] [hh oh]").gain(0.2),            // hats
  note("c3 eb3 g3 c4")                       // melody
    .s("sawtooth")
    .lpf(sine.range(400, 2000).slow(8))      // filter sweep
    .attack(0.01).decay(0.3).sustain(0.2)    // ADSR envelope
    .room(0.4).delay(0.2)                    // space
    .gain(0.3)
)

Syntax	Meaning
`"a b c d"`	Sequence (one per beat)
`"[a b]"`	Subdivide (two in one beat)
`"<a b c>"`	Alternate per cycle (slowcat)
`"a*3"`	Repeat
`"~"`	Rest / silence
`.slow(2)` / `.fast(2)`	Time stretch
`.euclid(3,8)`	Euclidean rhythm

Mood	Tempo	Key/Scale	Character
tension	60-80	minor/phrygian	Low cutoff, sparse, drones
combat	120-160	minor	Heavy drums, fast, distorted
peace	60-80	pentatonic/major	Warm, slow, ambient
mystery	70-90	whole tone	Reverb, sparse
victory	110-130	major	Bright, fanfare
ritual	45-60	dorian	Organ drones, chant

// ❌ WRONG — all values play simultaneously, causes clipping
s("kick").gain("0.3 0.3 0.5 0.3")

// ✅ RIGHT — one value per cycle
s("kick").gain("<0.3 0.3 0.5 0.3>")

ffmpeg -i output.wav -af loudnorm=print_format=json -f null - 2>&1 | grep -E "input_i|input_tp"

Audio → Demucs (stems) → librosa (analysis) → strudel.json → Composition → Render

src/runtime/
  chunked-render.mjs      — Chunked offline renderer (avoids OOM on long pieces)
  offline-render-v2.mjs    — Core offline renderer
  smoke-test.mjs           — 12-point smoke test
scripts/
  download-samples.sh      — Download dirt-samples (idempotent)
  samples-manage.sh        — Sample pack manager
  vc-play.mjs              — Stream audio to Discord VC
samples/                   — Sample packs + strudel.json (gitignored)
assets/compositions/       — 15 original compositions
src/compositions/          — Audio deconstructions
references/                — Mood trees, techniques, architecture
docs/
  KNOWN-PITFALLS.md        — Critical composition pitfalls
  ONBOARDING.md            — Machine-actor onboarding guide

Platform	Issue	Workaround
ARM64 (all)	PyTorch CPU-only, no CUDA	Expected — Demucs runs ~0.25× realtime
ARM64 (all)	`torchaudio.save()` fails	Patch `demucs/audio.py` to use `soundfile.write()` (see First-Time Setup)
ARM64 (all)	`torchcodec` build fails	Not needed — skip it, Demucs works without it
WSL2	Discord VC silent (NAT blocks UDP)	Enable mirrored networking in `.wslconfig`
All	Strudel `mini` parser not registered	Renderer calls `setStringParser(mini.mini)` — already handled

Strudel Music

Strudel Music 🎵

⚠️ SESSION SAFETY — READ THIS FIRST

Strudel Music

Strudel Music 🎵

⚠️ SESSION SAFETY — READ THIS FIRST

Quick Start

Commands

Composition Workflow

Discord VC Streaming

Sample Management

Directory Layout

strudel.json Format

Managing Packs

Composition Guide

Pattern Basics

WSL2 Note

Platform Requirements

Compose & Render (JS-only)

Full Pipeline (audio deconstruction with Demucs)

Full Pipeline (Audio Deconstruction)

Quick version

Timings (ballpark)

⚠️ Session Safety — READ THIS

How to run safely

What NOT to do

Learning Resources

How It Works

Composition Reference

Tempo

Mini Notation Quick Ref

Mood → Parameter Decision Tree

⚠️ Critical Pitfall: Gain Patterns

Loudness Validation

Audio Deconstruction Pipeline

File Structure

Renderer Internals

Known Platform Issues

🔒 Security Model

Discord Integration

npm install safety

What scripts/download-samples.sh fetches

What scripts/samples-manage.sh does

Concurrency

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api

What `scripts/download-samples.sh` fetches

What `scripts/samples-manage.sh` does