ModelsMar 3, 2026

Whisper Distil Large v3.5: Faster English Dictation on Mac

If you dictate in English all day — long documents, continuous notes, hours of voice input — transcription speed matters. Distil-Whisper Large v3.5 is a version of Whisper specifically trained to be faster on English while keeping accuracy within about 1% of the full Turbo model. Here is what it is, when to choose it, and how it compares to today's alternatives on Mac.

What knowledge distillation means in practice

Knowledge distillation is a technique where a smaller “student” model is trained to mimic a larger “teacher” model. The student learns from the teacher's outputs rather than from raw data alone, which means it can achieve comparable accuracy with fewer parameters.

Distil-Whisper Large v3.5 was trained this way on English data, distilling from Whisper Large V3. The result: it runs 1.5x faster than Whisper Turbo on the same hardware, with English word error rates within roughly 1% on short-form content. The download size is comparable at around 1 GB — the compression comes from the architecture, not the quantization.

On Apple Silicon, that 1.5x speed improvement is noticeable when processing longer recordings or transcribing back-to-back quickly.

The tradeoff: English only

Distil-Whisper is trained on English. It doesn't support other languages. If there's any chance you'll need to dictate in a second language, use Whisper Turbo instead. Distil-Whisper's speed advantage comes precisely from focusing the model on a single language, so multilingual use isn't a design goal here.

How to run it on Mac

Distil-Whisper is a research-grade model, not an app. To run it locally on Apple Silicon you need a runtime:

whisper.cpp — Georgi Gerganov's C++ port with Core ML and Metal acceleration. Compiles natively, runs fast on M-series chips, command-line driven.
MacWhisper — a Mac GUI for transcribing audio files, supports Distil-Whisper and other Whisper variants.
insanely-fast-whisper — a Python wrapper optimized for speed; works on Mac via MPS.

These cover file transcription. For live, hotkey-driven dictation into any text field, you need a different kind of tool.

Distil-Whisper vs. Parakeet for live English dictation

Worth comparing directly: NVIDIA's Parakeet TDT 0.6B v3 is an excellent English model and is smaller at 640 MB. On most English content, Parakeet's accuracy is competitive with or better than Distil-Whisper, with lower latency for short-form input — the kind of input live dictation produces.

Distil-Whisper makes sense if you specifically want Whisper-architecture transcription — for consistency with other Whisper-based systems, or because your content is served better by Whisper's training distribution. Parakeet tends to win on cold-start latency and pure-English WER.

A live dictation alternative

If what you actually want is fast English dictation on Mac — press a key, speak, clean text in any app — Resonant runs Parakeet locally on Apple Silicon's Neural Engine. Audio never leaves your Mac. No model picker; one model, tuned for live latency. Free to use.

If you specifically want Distil-Whisper for batch transcription, whisper.cpp or MacWhisper are the right path. If you want low-friction live English dictation, download Resonant.