Voice dictation on Windows 11 (Win+H) works, but it has three problems that get in the way of serious use:
- Average accuracy. Proper names, slang, technical terms — it misses often and there’s no easy way to correct.
- Partial cloud dependency. Some modes send audio to Microsoft’s servers.
- No per-app hotkey. Only works with Windows’ popup, which steals focus and breaks flow.
If you write a lot — email, Slack, Obsidian, scripts, code — better dictation pays for itself in hours saved per week. Here’s the setup with VoxBooster in 2026.
What changes with local, neural dictation
VoxBooster uses Whisper-derived models running locally on your PC. Practical differences:
- High enough accuracy you don’t need to edit much. Accepts punctuation by command (“comma,” “period,” “new line”).
- 100% local. Nothing goes to a server. Important for anyone dictating sensitive documents, proprietary code or professional conversation.
- Global hotkey. Hold
Ctrl+Shift+Din any app — Outlook, VS Code, browser, WhatsApp Web — speak, release, the text appears in the field where the cursor is.
Setup
- In VoxBooster, go to Dictation and enable.
- Pick a model: Fast (200 ms latency, good accuracy) or Precise (500 ms latency, high accuracy). For most uses, Fast is enough.
- Set the primary language and optional secondary — useful if you mix technical English into your speech.
- Set the global hotkey. Default:
Ctrl+Shift+D. - Done. Open any app, place the cursor, hold the hotkey, speak, release.
Modes of use
Push-to-dictate. Hold the hotkey, speak, release. Text appears immediately. This is the mode for 80% of cases.
Toggle. Press the hotkey once to turn on, again to turn off. Useful for long dictation (script, big email) where you don’t want to hold a key down.
Always on. Passive recognition waiting for a voice trigger (“Vox, write…”). Useful for hands-free flow like driving or with RSI.
Useful commands
- “comma” →
, - “period” / “full stop” →
. - “question mark” →
? - “exclamation mark” →
! - “new line” → line break
- “new paragraph” → two line breaks
- “colon” →
: - “open paren / close paren” →
(/) - “delete last word” → backspace the word
- “delete everything” → clears the current dictation buffer
Use cases that work well
- Email in volume. Dictating a draft is 3x faster than typing. 30-second review at the end.
- Personal notes in Obsidian / Notion. Capturing an idea at the rhythm it hits your head, not limited by typing speed.
- Replying on Slack / WhatsApp Web. On a long reply, dictating with a global hotkey is faster than switching to phone and using phone dictation.
- Audio messages in text form. Record the idea speaking, get it in text, quick edit. Respects the reader’s time better than audio.
- Scripts, blog posts, documentation. First draft dictated is 2-4x faster than typed. Text edit later, normally.
Cases where dictation doesn’t win
- Code. Variable names, syntax, braces open/close — dictating code is slower than typing with autocomplete. Exception: long comment in a function, which dictates well.
- Tables / spreadsheets. Grid structure doesn’t match voice flow.
- Noisy environment. Room with loud AC, café, coworking — accuracy drops. Better back to typing.
Privacy
The model runs locally, audio never leaves your PC, no dictation log is saved by default. If you want a log — to review lost drafts — enable “Keep history” in settings; files go to a local folder only you can access, encrypted by Windows.
That matters for anyone dictating sensitive conversation. Windows native dictation, even in offline mode, shares device telemetry. VoxBooster doesn’t — no telemetry at all in the dictation module.