Voice to Text in Microsoft Word on Mac
3 Ways That Actually Work (2026)
Three options exist for using voice to text in Microsoft Word on Mac. They have meaningfully different requirements and trade-offs. This guide covers all three honestly — what each one needs, how to set it up, and where it falls short — so you can pick the right one for how you work.
Option 1: Word's built-in Dictate
Word for Mac has a Dictate button in the Home tab of the ribbon. It's the most obvious starting point and the one Microsoft promotes.
Requirements
- Microsoft 365 subscription— Dictate is not available on perpetual Office licenses (Office 2019, Office 2021, Office 2024). If you bought Word once and don't have a subscription, the button won't appear.
- Internet connection— Word streams audio to Microsoft's servers for transcription. It does not work offline.
- Microphone permission— Word needs Microphone access in System Settings > Privacy & Security > Microphone.
Setup
- Open System Settings > Privacy & Security > Microphone and toggle Microsoft Word on.
- Open a Word document.
- Click the Home tab in the ribbon.
- Click the Dictate button (microphone icon, usually in the Voice section on the right side of the ribbon).
- Wait for the red recording indicator to appear.
- Speak naturally — text appears as you talk.
- Click Dictate again or press Option (⌥) + F1 to stop.
What it's good at
Convenient if you already pay for Microsoft 365. No extra app to install. Works inside the Word UI with a floating toolbar for language and punctuation settings. Supports 30+ languages.
Where it falls short
- Subscription wall— no Microsoft 365, no Dictate.
- Cloud-only— audio leaves your Mac on every session.
- Session stability— drops on silence, network hiccups, or token refreshes. Users report it “stopping mid-sentence.”
- Manual punctuation— you need to say “comma,” “period,” “new line” out loud.
- Filler words stay— every “um” and “uh” is transcribed.
- Word-only— doesn't help you in the cover email, the Slack message, or the browser tab you switch to next.
- Enterprise blocks— IT departments in legal, healthcare, finance, and government often disable cloud dictation by policy.
Option 2: macOS system dictation
Every Mac has built-in dictation. It works in any text field — including Word — and doesn't require a Microsoft subscription.
Requirements
- macOS Ventura (13) or laterfor on-device processing of short sessions. Older macOS versions send audio to Apple's servers.
- Dictation enabledin System Settings > Keyboard > Dictation.
Setup
- Open System Settings > Keyboard.
- Scroll to Dictation and toggle it on.
- Choose your language and preferred shortcut (default: fn pressed twice).
- Click into a Word document.
- Press fn twice and speak.
- Press fn again or click Done to stop.
What it's good at
Free. No subscription. Works in any text field, not just Word. Basic auto-punctuation on macOS Ventura and later. Supports 60+ languages.
Where it falls short
- Short sessions only— Apple dictation stops after roughly 30–60 seconds of continuous speech. For anything longer than a sentence or two, you'll need to restart it repeatedly.
- Partial cloud processing— longer sessions or older macOS versions send audio to Apple for processing.
- Accuracy— acceptable for casual text, but struggles with domain vocabulary, legal terms, and proper nouns.
- No filler removal— transcribes everything verbatim.
- No formatting intelligence— no awareness of the app you're in or the tone the document needs.
Option 3: Resonant — system-wide dictation, fully on-device
Resonant is a Mac app built for the kind of dictation most people actually need: long-form, accurate, private, and system-wide. It removes the constraints of both Word's Dictate and macOS dictation.
Requirements
- macOS 14+ on Apple Silicon (M1 or later).
- No subscription— free to download and use.
- No internet— works fully offline.
Setup
- Download Resonant and move it to Applications.
- Grant Microphone and Accessibility permissions when prompted.
- Click into a Word document (or any other app).
- Press fn and speak.
- Release the key. Clean, punctuated text lands in the document.
What it's good at
- Long-form dictation— no session limits. Dictate for as long as you hold the hotkey.
- Automatic punctuation— neural models infer punctuation from the rhythm of your speech. No “comma,” “period,” “new line.”
- Filler word removal— “um,” “uh,” false starts, and restarts are cleaned automatically.
- Modern speech models— Parakeet, compiled to CoreML for Apple Neural Engine. Accurate on legal terms, medical vocabulary, proper nouns, and technical jargon.
- Fully offline— audio never leaves your Mac. Works on planes, in regulated environments, on bad Wi-Fi.
- System-wide— same hotkey works in Word, Outlook, Slack, Safari, Chrome, terminal, and every other app.
- Context-aware tone— adapts formatting for email, Slack, code, and formal documents.
Where it falls short
- Mac only— requires Apple Silicon (M1 or later). No Windows or web version yet.
- Separate app— not built into Word's ribbon. This is also the reason it works everywhere.
Side-by-side comparison
Here's how the three options compare on the things that matter for daily use.
| Feature | Word Dictate | macOS Dictation | Resonant |
|---|---|---|---|
| Cost | Microsoft 365 subscription (~$7–$13/mo) | Free (built into macOS) | Free |
| Internet required | Yes — always | Partially — short offline, longer sessions need network | No — fully offline |
| Audio processing | Microsoft’s cloud servers | On-device for short sessions, Apple servers for longer | On-device, always |
| Session length | Drops on silence, token refresh, or network wobble | ~30–60 seconds continuous, then stops | As long as you hold the hotkey |
| Automatic punctuation | Must say “comma,” “period,” “new line” | Basic auto-punctuation | Full neural punctuation from speech rhythm |
| Filler word removal | No — transcribes every “uh” and “um” | No | Yes — cleaned automatically |
| Works outside Word | Word and some Office apps only | Any text field on Mac | Any text field on Mac |
| Accuracy on names and jargon | Older cloud engine — struggles with domain terms | Moderate | Modern neural models (Parakeet) |
| Languages | 30+ | 60+ | 25+ on-device |
| Enterprise / regulated use | Often blocked by IT policy | Allowed but limited | Local-only — no cloud audio in your data flow |
Which one should you use?
If you already pay for Microsoft 365 and only need occasional short dictation inside Word: Word's built-in Dictate is fine. It works, it's convenient, and you don't need to install anything. Just know that it's cloud-only, drops sessions, and won't help you outside Office apps.
If you want something free and quick for a sentence or two: macOS dictation. Press fn twice and go. It's limited, but for a short burst in any app, it works.
If you dictate regularly, write long-form documents, care about accuracy, or work in a regulated environment: Resonant. It handles long sessions without dropping, cleans up your speech automatically, runs entirely on-device, and works in every app — not just Word. Most people who try all three options end up here.
Frequently asked questions
Can I use voice to text in Word on Mac without a Microsoft 365 subscription?
Yes. Word's built-in Dictate button requires a Microsoft 365 subscription, but macOS system dictation and Resonant can both type dictated text into Word without one. Resonant works with any version of Word — desktop, web, or perpetual license.
Does Word voice to text work offline on Mac?
No. Word's Dictate streams audio to Microsoft's servers. macOS dictation handles short sessions locally on Ventura and later. Resonant is the only option that works fully offline for any length of dictation.
Why is Word voice to text not working on my Mac?
The most common causes: no active Microsoft 365 subscription, no internet connection, microphone permission not granted to Word in System Settings, or an enterprise policy that blocks cloud speech services. If you can't resolve these, macOS dictation or Resonant bypass all of them.
How long can I dictate continuously?
Word's Dictate drops on silence and network changes — practical limit is a few sentences at a time. macOS dictation stops after 30–60 seconds. Resonant has no time limit; the session runs as long as you hold the hotkey.
Does my voice data stay private?
With Word's Dictate, audio goes to Microsoft's servers. macOS dictation may send audio to Apple for longer sessions. Resonant processes audio entirely on your Mac — nothing is uploaded, stored, or sent to any server.
What Resonant offers beyond dictation
Resonant isn't just a faster way to type. It's a voice workspace with capabilities no other dictation tool provides.
MCP server for AI tools
Resonant exposes 11 MCP tools that let any AI agent — Claude, Codex, and more — query your entire voice workspace — meetings, dictations, memos, ambient context, and daily journal. Your AI assistant knows what you said this morning. Learn more
Meeting transcription with speaker labels
Dual-channel recording — your mic and system audio on separate channels. NVIDIA Sortformer diarization identifies who said what. No bot joins the call. No audio leaves your Mac. Learn more
Ambient context capture
Passively records which apps you use, window titles, URLs, and dwell time — all locally. This makes dictation context-aware and gives your AI tools a queryable work timeline. Learn more
Two on-device speech models
NVIDIA Parakeet TDT v3 (0.6B, 25 languages) and Qwen3 ASR (0.6B, 30+ languages), both compiled to CoreML and running on Apple Neural Engine. Under 4% WER on English benchmarks. Learn more
Cloud cleanup with hallucination detection
Optional AI post-processing fixes STT errors and adapts to context (email, message, code). Guardrails detect when the LLM rewrites your meaning instead of cleaning your grammar. Learn more
Start with private Mac dictation
Local speech recognition is free and runs on your Mac. Pro adds cloud cleanup, rewrites, summaries, and sharing when you want the full workflow.