Professional voice-to-text that runs entirely on your machine. No cloud, no latency, no compromise.
Every word you speak is processed on your device. No audio, text, or metadata is ever transmitted to any server. There is no cloud backend, no telemetry, no analytics, and no phone-home.
Brethof Voice Pro uses the GGUF-optimized engine with llama.cpp for blazing-fast inference. Supports all three major GPU vendors out of the box.
Speak in any supported language and the engine auto-detects it. Or lock to a specific language for maximum accuracy. All processing happens locally.
Choose the perfect balance of accuracy, speed, and resource usage. All tiers use the same Qwen3-ASR architecture with different encoder precision and model sizes.
Switch between tiers at any time from the app settings. No re-download needed — all tiers ship with the installer.
Built-in DeepFilter noise suppression processes your microphone input in real-time before transcription. Background noise, keyboard sounds, and room echo are removed automatically.
Fine-tune the model on your own voice using LoRA (Low-Rank Adaptation) to improve recognition of your accent, language, and speaking style. Not available in v1.0 — we disabled training when we moved the inference engine to GGUF, and we are waiting to ship it until the PyTorch-to-GGUF conversion pipeline is solid. When it lands it will be free for anyone with a Personal or Business license, and all training will run on your machine.
Transcribed text is typed directly into whatever application has focus. No copy-paste, no clipboard. It works like a keyboard — press your hotkey, speak, and the text appears.
Provide a list of domain-specific terms, names, or jargon to bias the model toward correct recognition. Ideal for technical dictation, medical notes, or any specialized vocabulary.
14-day free trial. All features unlocked. No credit card.