macOS voice-to-text
Speak. Transcribe.
Enhance.
Voice input that thinks after it listens.
A glass pill on your desktop: press a hotkey, speak, and watch text land wherever your cursor is. Powered by Gladia for transcription, then routed through Claude, GPT, Gemini, or Ollama to clean, reformat, and paste text ready to use.
macOS 13+, Apple Silicon. Free while in beta.
> Ship the voice polish this week. We already have Gladia wired, just need the floating UI to feel invisible until you need it.
Features
Everything you need to dictate like a pro
Transcription is just the start. DD Voice layers AI post-processing, output presets, searchable history, and domain vocabulary on top.
Global Hotkey
Press Cmd+Shift+V from any app. No context switching. Start dictating the moment you need it.
AI Post-Processing
Route your transcript through Claude, GPT-4, Gemini, or a local Ollama model to clean, reformat, or restructure before it hits your clipboard.
Output Style Presets
Choose Clean, Notes, Email, or Code. Each preset tells the AI how to format your text so it lands ready to use.
Transcription History
Every recording is saved locally with full-text search. Find a phrase from last Tuesday, copy it, and move on.
Custom Vocabulary
Add domain-specific terms (Kubernetes, OAuth, your product names) so Gladia nails the hard words every time.
Speaker Diarization
Recording a meeting or pair session? DD Voice labels who said what so your transcript reads like a conversation.
Live Waveform
See your input level in real time. Know the mic is listening before you commit a long thought.
Instant Paste
Transcription drops right where your cursor is. Docs, Slack, email, VS Code. No copy-paste required.
How it works
Three steps. Zero friction.
Press a key, talk, and get polished text. The whole loop takes seconds.
- 1
Speak
Press the hotkey from any app. A tiny glass pill appears and starts recording. Talk at your normal pace.
- 2
Transcribe
Gladia processes your audio with pro-grade speech-to-text. Custom vocabulary and speaker labels are applied automatically.
- 3
Enhance
Your chosen AI model (Claude, GPT, Gemini, or Ollama) cleans and reformats the text using your selected output preset, then pastes it in place.
Output presets shape the result
Clean
Removes filler words and fixes grammar. The default for everyday dictation.
Notes
Restructures into bullet points and headings. Great for meetings and brainstorms.
Formats as a professional email draft with greeting and sign-off.
Code
Converts spoken descriptions into code comments or documentation blocks.
Compare
DD Voice vs. the rest
Most dictation tools stop at raw transcription. DD Voice keeps going.
| Feature | DD Voice | Whisper | Otter.ai | macOS Dictation |
|---|---|---|---|---|
| Global hotkey from any app | ||||
| AI post-processing (LLM) | ||||
| Output style presets | ||||
| Model choice (Claude/GPT/Gemini/Ollama) | ||||
| Speaker diarization | ||||
| Custom vocabulary | ||||
| Searchable history | ||||
| Works offline | ||||
| Auto-paste at cursor | ||||
| 10+ language support | ||||
| macOS native menu bar app | ||||
| Free to start |
Pricing
Simple pricing. Kicks in after beta.
Free while in beta. When it launches, pick a plan that matches your volume.
Starter
$5/month
- 5,000 words/month
- All languages
- Auto-paste at cursor
- Transcription history
- Custom vocabulary
Pro
$12/month
- 25,000 words/month
- AI post-processing (Claude, GPT, Gemini)
- Output style presets
- Speaker diarization
- Priority transcription
Unlimited
$24/month
- Unlimited words
- All Pro features
- Local Ollama support
- Custom system prompts
- Export to JSON
Download DD Voice
macOS 13+ and Apple Silicon. Free while in beta.