Speech to Text — Whisper (local)

Runs OpenAI's Whisper model entirely in your browser. After the model downloads once, audio never leaves the device.

Truly local. Audio is processed on this page using transformers.js running ONNX models in WebAssembly + WebGPU. No audio is sent to any server. The model itself is downloaded once from Hugging Face's CDN, then cached by your browser.

1. Pick a model and load it

not loaded

2. Listen

idle
mic level
silent queue: 0

Transcript

transcript will appear here. each chunk arrives 1–4 s after the speaker pauses.
0 words

How this works