Transcribe 30 languages + 22 Chinese dialects
Translate 38 languages, fully offline
Timestamp make your own subtitles
MCP server talk to it from your AI stack
Runs local — even on laptops•No subscription•14-day free trial
Every word you speak is processed on your device. No audio, text, or metadata is ever transmitted to any server. There is no cloud backend, no telemetry, no analytics, and no phone-home.
Brethof Voice Pro uses the GGUF-optimized engine with llama.cpp for blazing-fast inference. Supports all three major GPU vendors out of the box.
Powered by Qwen3-ASR via llama.cpp. Lock to a specific language for maximum accuracy, or let the engine auto-detect. Every word stays on your machine.
Plus 22 Chinese regional dialects (Anhui, Dongbei, Fujian, Henan, Hunan, Shandong, Sichuan, Wu, Minnan, and more) recognised automatically when the language is set to Chinese or auto-detect.
Translate any transcription, voice-keyboard output, plain text, or subtitle file — entirely on your machine. Powered by Tencent Hunyuan MT2: on FLORES-200 (XCOMET-XXL) the Quality tier reaches 97.9% of Google Gemini 3.1 Pro and the compact Fast tier 89.9%, and it surpasses Gemini 3.1 Pro on real-world (WildMTBench) and minority-language translation.
Pick the balance of accuracy, speed, and VRAM that suits your machine. Both run the same Qwen3-ASR architecture; switch any time from Settings → Models.
Optional add-ons download on demand from Settings → Models: Forced Aligner (~540 MB) for word-level timestamps, Hunyuan MT2 Fast (~1 GB) or Quality (~4.3 GB) for translation.
Optional DeepFilter noise suppression for recordings made in noisy rooms — off by default, enable from the Noise popup. Skipping it on clean mic clips actually helps quality (DeepFilter can over-process short, clean audio).
Fine-tune the model on your own voice with LoRA — runs end-to-end on your machine. Every time you correct a misrecognised word, the {clip, correction} pair is saved to your local training dataset. The main window's training card shows total samples and minutes captured at a glance — click it to open the dataset browser, then "Start training" in the Training tab.
Hold the hotkey, speak, and the text lands wherever your cursor is — like a keyboard. Works in browsers, IDEs, terminals, chat apps, anywhere a text field accepts keyboard input.
EN: … || PL: …), or first target only.One field, two uses. Bias the ASR toward proper nouns, brand names, and jargon — reduces "VFIO" being mistranscribed as "VEAF1". Same field doubles as the translation terminology dictionary — pin "Brethof Voice" to stay "Brethof Voice" in every target language.
The same binary that runs the GUI can run as a Model Context Protocol server — 19 tools exposing ASR and MT to Claude Desktop, Claude Code, Cursor, Cline, or any MCP-compatible agent. Transport is stdio: no port, no firewall, no localhost binding. The agent owns the lifecycle.
Run brethof-voice --mcp and the agent connects over stdio. Paid licence required — trial users can't start the server.
14-day free trial. All features unlocked. No credit card.
Local speech-to-text that learns your voice. Perpetual licence. Our flagship.
PAID · flagship
Local long-term memory for Claude Code — full-text + vector + graph, on SurrealDB. MIT.
FREE · open source
Print-ready digital models. STL/3MF/OBJ included. Lifetime access.
PAID · digital catalog
Our printed designs, shipped across Europe. Buy the object, not the file.
PAID · physical objects
Cyber-tiger AI host. Privacy-first AI explained without the corporate filter.
CHANNEL · live
Curated GitHub lists for AI, MCP, local AI, Linux for AI, and more. Receipts, not vibes.
FREE · curated
Long-form how-tos for local AI on Linux, Windows, macOS. Real configs, not marketing.
FREE · coming soon
Production-tested ComfyUI graphs — LTX chunked-loop, the Nova pipeline, and more.
FREE · workflows landing
Negative-curation: practices and tools that waste your time, ranked. Receipts required.
FREE · coming soon
Who we are, why we build local-first AI, and what we won't do.