Updates — Brethof Voice Pro

Release May 21, 2026

v2.0.0 — Offline translation in 38 languages

Brethof Voice Pro is no longer just voice-to-text. v2.0.0 ships offline translation powered by Tencent Hunyuan MT2 — on FLORES-200 (XCOMET-XXL) the 7B tier reaches 97.9% of Google Gemini 3.1 Pro, and it surpasses Gemini on real-world and minority-language tests. Translation runs entirely on your machine. Two model tiers, downloaded on demand: Fast (~1 GB, sub-second on CPU or GPU) and Quality (~4.3 GB, sub-second on GPU). Plus several long-awaited additions.

What is new in v2.0.0:

Transcribe + translate — the Transcribe popup has a "Translate to:" dropdown. ASR transcribes, MT translates, both render side-by-side. Works on files, mic, and system audio (loopback capture of meeting calls, browser tabs, anything playing on your speakers).
Multilingual voice keyboard — new translation chip on the main row opens a 3-column language grid. Pick one or more targets, the keyboard types the translation. Layout modes: one per line, inline (EN: … || PL: …), or first target only.
SRT / VTT subtitle translator — preserves timings + cue indices, supports bilingual mode (source line + translation per cue). Exposed via MCP for AI-agent automation.
Multi-GPU per engine — Settings → Models has an independent device picker for each engine. Run ASR on Vulkan 0 and translation on Vulkan 1, or pin Hunyuan MT2 7B to CPU on VRAM-tight laptops.
Personal voice training is fully live — LoRA fine-tuning on your own voice, end-to-end on your machine. The main window's training card shows total samples and minutes captured at a glance; "Start training" in the Training tab auto-picks NVIDIA CUDA or CPU, then auto-exports the trained model to GGUF. Free with every paid licence.
4 new MCP tools — translate_text, translate_srt, list_compute_devices, set_compute_device. Total tool count now 19.
Honest language splits — ASR: 30 selectable languages + auto-detect + 22 Chinese dialects (52 total). Translation: 38 languages (33 main + 5 Chinese-area minority). 23 work in both directions.

Linux binary is 161 MB, Windows installer is 118 MB. Same launch prices: $49 personal, $149 business. Existing licences carry over — just download v2.0.0 and the translation models will appear in Settings → Models.

Download v2.0.0 →

Release May 20, 2026

v1.6.16 — Personal voice training is live

The training pipeline shipped. LoRA fine-tuning on your own voice now runs end-to-end on your machine — the app auto-picks NVIDIA CUDA or CPU, then auto-exports the trained model to GGUF when done. Every correction you make in the GUI is auto-saved to your local training dataset; the main window's training card shows total samples and minutes at a glance.

Bonus: voice-keyboard accuracy improved across all languages thanks to a llama.cpp upgrade (build b9222) that fixed a chunk-boundary collapse on long clips. Free with every paid licence.

Engineering May 15, 2026

v1.6.0 — Async transcription jobs + word-level SRT polish

Two new MCP tools land: start_transcription returns a job ID instantly so the agent can do other work, and get_transcription_status polls for completion. One job at a time, result inlined when done. Long files no longer block the agent loop.

Plus a chain of fixes to word-level SRT/VTT output: no more stranded spaces before punctuation, no more lone-dot cues, no more hotword/context strings leaking into the transcript. Cleaner subtitles, no manual cleanup needed.

Engineering May 15, 2026

v1.5.0 — Killed ONNX, runtime is now pure llama.cpp + libmtmd

Full engine rewrite. Brethof Voice Pro now runs Qwen3-ASR end-to-end on llama.cpp with GGUF-quantised weights via libmtmd. ONNX Runtime is gone. The result: smaller install (~83 MB binary, down from 400+ MB), faster cold-start, and no more fighting per-platform CUDA/DirectML wheels.

Vulkan picks up your GPU automatically — NVIDIA, AMD, or Intel Arc — with a CPU fallback when no GPU is present. The same engine now powers every downstream feature: ASR, voice keyboard, the MCP server, and (now in v2.0.0) translation.

Engineering May 14, 2026

v1.4.0 — MCP server debut + multi-GPU device picker

The Model Context Protocol server lands. Any MCP-compatible AI agent — Claude Desktop, Claude Code, Cursor, Cline — can drive transcription over stdio (no port, no firewall). Same release ships a multi-GPU device selector so you can pick which Vulkan GPU runs ASR, plus the optional Forced Aligner add-on for word-level timestamps on every transcription.

Paid-tier only — the MCP server refuses to launch without a Personal or Business licence. brethof-voice --mcp is the one-line invocation.

Roadmap Apr 16, 2026

macOS version in development — Apple Silicon first

The most-asked question before launch was "when macOS?". Answer: in active development. Apple Silicon native build with Metal acceleration comes first, Intel follows. Target Q3 2026 — and we are opening a closed beta in Q2. Here is what is being built and how to sign up.

Read post →

Product Apr 16, 2026

Personal model training — fine-tune Brethof Voice Pro on your own voice

Personal voice training is live in Voice Pro. Every time you correct a misrecognised word, the audio clip + correction is auto-saved to your local training dataset. One click in the Training tab fine-tunes a LoRA on your accent — the app picks NVIDIA CUDA or CPU automatically, then auto-exports the trained model to GGUF. Free with every paid licence.

Read post →

Announcement Apr 16, 2026

Brethof Voice Pro is live — voice to text that stays on your machine

After months of engineering, Voice Pro v1.0 ships today for Windows and Linux. 30 transcription languages plus 22 Chinese dialects, fully offline transcription, hotkey-anywhere dictation, and a one-time price with no subscription. Here is what made it into the launch build and what we are working on next.

Read post →

Champions Apr 16, 2026

1,800 free licenses up for grabs: the Champions Program is open

The Champions Program opens today and runs until May 16. Fifty free Personal licenses per supported language — 1,800 total — plus 70% off for every qualifier who does not land in the top 50. Here is how it works and why we are doing it this way.

Read post →

Licensing Apr 16, 2026

Personal license now covers solo business and freelance work

We listened to early feedback from writers, consultants, and translators who felt the old "personal use only" line was confusing. The Personal license at $49 now explicitly covers solo and freelance commercial use. Business license ($149/seat) is for teams. Here is what changed and why.

Read post →

Product Apr 12, 2026

30 transcription languages, fully translated UI — not a machine-translated wall of text

Voice Pro now transcribes and presents itself in 30 languages plus 22 Chinese dialects: Arabic, Cantonese, Chinese, Czech, Danish, Dutch, English, Filipino, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Macedonian, Malay, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swedish, Thai, Turkish, Vietnamese. Here is how the app picks the right one and what "fully translated" actually means.

Read post →

Engineering Apr 8, 2026

Why we moved from ONNX to GGUF — faster, smaller, easier to ship

Voice Pro's ASR backend runs on llama.cpp with GGUF-quantised Qwen models instead of ONNX Runtime. The result: a smaller install (83 MB exe vs 400+ MB), faster cold-start, and no more fighting with CUDA/DirectML wheels on every platform. Here is the engineering story behind the switch.

Read post →

Principles Apr 4, 2026

Why Brethof Voice Pro runs 100% offline — and why that is not just a feature

Every major dictation product sends your voice through someone else's server. We think that is the wrong default. Your voice is the most personal data you generate — medical notes, legal drafts, private journals, work secrets. Here is why Voice Pro has no cloud mode, no "optional telemetry", and no account requirement to transcribe.

Read post →

Get updates in your inbox

Create an account to be notified when we publish new posts and ship new versions.

Create Account

Product Updates

v2.0.0 — Offline translation in 38 languages

v1.6.16 — Personal voice training is live

v1.6.0 — Async transcription jobs + word-level SRT polish

v1.5.0 — Killed ONNX, runtime is now pure llama.cpp + libmtmd

v1.4.0 — MCP server debut + multi-GPU device picker

macOS version in development — Apple Silicon first

Personal model training — fine-tune Brethof Voice Pro on your own voice

Brethof Voice Pro is live — voice to text that stays on your machine

1,800 free licenses up for grabs: the Champions Program is open

Personal license now covers solo business and freelance work

30 transcription languages, fully translated UI — not a machine-translated wall of text

Why we moved from ONNX to GGUF — faster, smaller, easier to ship

Why Brethof Voice Pro runs 100% offline — and why that is not just a feature

Get updates in your inbox

Everything we build

Voice Pro

brethof-mind

3D Models

3D Prints

Nova

Awesome lists

Guides

ComfyUI workflows

Anti-dev tier list

About Brethof AI