Overview

Use speech-to-text tools to dictate your initial specs, feature ideas, and architectural thoughts instead of typing them. Speaking naturally produces longer, more context-rich input because people self-edit heavily when typing but ramble freely when talking. That rambling is gold — LLMs are excellent at extracting structured meaning from unstructured speech.

Core principle: Speaking freely captures intent that is hard to express in typed text. Don't self-edit — ramble and let the LLM find the structure.

Dependency: A speech-to-text tool (WhisperFlow, macOS Dictation, or similar).

When to Use

At the start of a new feature when you have a rough idea but haven't formalized it
During brainstorming when you want to explore multiple approaches quickly
When writing specs, PRDs, or design documents from scratch
When you catch yourself staring at a blank prompt, unsure how to phrase what you want
When explaining a bug or problem you understand intuitively but struggle to articulate in text

When NOT to Use

Overview

Core principle: Speaking freely captures intent that is hard to express in typed text. Don't self-edit — ramble and let the LLM find the structure.

Dependency: A speech-to-text tool (WhisperFlow, macOS Dictation, or similar).

When to Use

At the start of a new feature when you have a rough idea but haven't formalized it
During brainstorming when you want to explore multiple approaches quickly
When writing specs, PRDs, or design documents from scratch
When you catch yourself staring at a blank prompt, unsure how to phrase what you want
When explaining a bug or problem you understand intuitively but struggle to articulate in text

Mistake	Why it's wrong
Editing yourself while speaking	The whole point is to capture raw, unfiltered intent. Self-editing while speaking defeats the purpose — you lose the same context you lose when typing. Just talk.
Skipping the transcription review	Speech-to-text makes errors. Quickly scan the transcript for mangled names, technical terms, or homophones before pasting it in. A 10-second scan prevents confused output.
Using voice during implementation	Voice shines during planning and ideation. Once you are writing code, typed instructions are more precise. Don't force voice where typing is better.
Pasting the transcript without a framing prompt	Claude Code needs to know what to do with the wall of text. Always prepend a short instruction like "Structure this into a feature spec" or "Extract the requirements from this transcript."
Speaking in short, clipped sentences	You are not typing. Speak in full, natural paragraphs. Explain the why, the context, the constraints, the edge cases. Longer is better — the LLM will compress it.

Tool	Platform	Notes
Wispr Flow	macOS, Windows, iOS, Android	AI-powered voice-to-text with auto-editing. 4x faster than typing. Recommended.
macOS Dictation	macOS	Built-in. Press Fn Fn (or Globe key twice) in any text field. Good enough for most use cases.
Superwhisper	macOS	Polished Whisper app with hotkey activation.
Windows Voice Typing	Windows	Press Win+H. Built-in, decent quality.
Google Docs Voice Typing	Browser	Tools > Voice typing. Works well, requires Chrome.

Item	Details
Recommended tool (macOS)	WhisperFlow or macOS Dictation (Fn Fn)
Recommended tool (Windows)	Windows Voice Typing (Win+H)
Ideal speaking length	1-3 minutes (~200-500 words of transcript)
Best stage to use	Planning, ideation, spec writing
Worst stage to use	Active implementation, precise code edits
Transcript editing	TMUX + VIM mode for quick surgical fixes
Key framing prompt	"Structure this spoken transcript into a feature spec with requirements, constraints, and open questions:"

Voice First Planning

Overview

When to Use

When NOT to Use

Voice First Planning

Overview

When to Use

When NOT to Use

Common Mistakes

The Workflow

Step 1: Set up a speech-to-text tool

Step 2: Speak your idea freely

Step 3: Quick-scan the transcript for errors

Step 4: Paste into Claude Code with a framing prompt

Step 5: Iterate on the structured output

Quick Reference

Key Principles

Attribution

Coding Agent (bash-first)

Install Vscode Extension

Launch

Agent Customization

Init

Launch