技能檔案

Workflow — Audio Production

Name: Workflow — Audio Production
Author: damionrashford

Full-stack audio routing across macOS / Linux / Windows, MIDI + OSC control surfaces, DAW integration over JACK, DSP (EQ / compression / loudness / spatial / binaural HRTF), AI denoise (DeepFilterNet / RNNoise / Resemble Enhance), stem separation (Demucs), TTS, Whisper transcription, and EBU R128 / Spotify / Apple / ATSC loudness certification. Use when the user says "set up DAW", "route audio through OBS", "MIDI control surface", "binaural ASMR", "stem separation", "loudness to −16 LUFS", "JACK transport", or anything audio-stack-related.

damionrashford1 星標2026年4月18日

職業
分類: 安全

技能內容

What: Route, process, and finish audio end-to-end across every OS and every control surface.

Skills used

audio-routing-docs, audio-pipewire, audio-jack, audio-coreaudio, audio-wasapi, media-midi, media-osc, media-sox, ffmpeg-audio-filter, ffmpeg-audio-fx, ffmpeg-audio-spatial, media-denoise-ai, media-demucs, media-musicgen, media-tts-ai, media-whisper, media-ffmpeg-normalize, ffmpeg-captions, ffmpeg-subtitles.

Pipeline

Step 1 — Platform audio routing

OS	Skill	What to do
Linux

相關技能

Workflow — Audio Production | Skills Pool

BlackHole / Loopback appear as BOTH input AND output on macOS. Route the app to OUTPUT; pick input in DAW/OBS.
PipeWire aliases default to the current Pulse sink. Script with the real sink name from pw-cli ls Node.
JACK buffer = latency frames ÷ sample rate (128 @ 48 kHz = 2.67 ms). Too low = xruns; too high = sluggish.
WASAPI loopback captures the render endpoint, not the source app. Use a virtual cable for per-app capture.
macOS aggregate devices clock to the FIRST device in the list. Others drift. Pick the clock master first.
Windows MME is legacy shared; WASAPI is exclusive/professional. For low latency, use WASAPI.
MIDI 1.0 = 3 bytes; MIDI 2.0 UMP = 4–16 bytes. Wrong protocol = silent ignore.
MIDI channels: 0–15 on the wire, 1–16 in UI. Off-by-one is the #1 bug.
MIDI running status compresses repeated types — old hardware uses it; modern mostly doesn't.
OSC bundles contain timed sub-messages. Parse type:bundle → .elements branch.
OSC address patterns support wildcards (/track/*/volume). Server-side wildcards fire multiple handlers.
OSC TCP 1.1 vs UDP 1.0. Pick one per endpoint; they're not interchangeable.
loudnorm is NOT a limiter. It normalizes. If peaks exceed target, it applies downward gain, not transient shaving. Add explicit alimiter.
aresample filter_size trades quality for speed. 512 = audiophile, 128 = streaming.
atempo range is 0.5–2.0 per instance. Chain for extremes.
asetrate changes pitch AND tempo together. Combine with atempo for pitch-preserving speed.
sidechaincompress — key input FIRST, signal SECOND. Wrong order = wrong ducking.
DeepFilterNet: 48 kHz MONO or 16 kHz MONO. Resample before.
RNNoise: 48 kHz mono 16-bit, hardcoded. Anything else fails silently.
Demucs: 44.1 / 48 kHz STEREO. Mono → degraded stems.
Whisper: 16 kHz mono best. Resamples internally; explicit is cleaner.
Voice cloning needs a CLEAN reference. DeepFilterNet the reference sample first.
Kokoro TTS outputs 24 kHz. Resample to 48 kHz before mixing.
Single-pass loudnorm uses guardrails (±1 LUFS). Two-pass measures then applies exactly. Use media-ffmpeg-normalize for certification.
True peak measures inter-sample peaks, 4× oversampled. Standard sample peak misses these.
LUFS = LKFS = same thing (different standards' names for integrated loudness).

Target	Integrated	True peak
Spotify Master	−14 LUFS	−2 dBTP
Apple Podcasts	−16 LUFS	−1 dBTP
ATSC A/85	−24 LUFS	varies
EBU R128 broadcast	−23 LUFS	−1 dBTP
ACX audiobooks	−19 LUFS	−3 dBTP

Workflow — Audio Production

Skills used

Pipeline

Step 1 — Platform audio routing

Workflow — Audio Production

Skills used

Pipeline

Step 1 — Platform audio routing

Step 2 — Control surfaces

Step 3 — DSP chain

Step 4 — Spatial audio

Step 5 — AI processing

Step 6 — Loudness certification

Variants

Gotchas

Example — Podcast-style live monitor with MIDI transport

1password

Springboot Security

Security Review

Laravel Security

Security Review

Django Security

Workflow — Audio Production

Skills used

Pipeline

Step 1 — Platform audio routing

Workflow — Audio Production

Skills used

Pipeline

Step 1 — Platform audio routing

Step 2 — Control surfaces

Step 3 — DSP chain

Step 4 — Spatial audio

Step 5 — AI processing

Step 6 — Loudness certification

Variants

Gotchas

Example — Podcast-style live monitor with MIDI transport

Related

1password

Springboot Security

Security Review

Laravel Security

Security Review

Django Security