100% Local — Your voice never leaves your machine

Voice to Text That Never Leaves Your Computer

State-of-the-art AI transcription running entirely on your device. No cloud, no subscriptions, no data collection. Fast, accurate, and private.

No credit card required. Works offline after setup.

🔒 100% Local — No cloud ever
🌐 36 Languages auto-detected
GPU Accelerated — NVIDIA, AMD, Intel
💻 Linux & Windows
✈️ Works Offline

Everything you need, nothing you don't

Professional voice-to-text powered by the Qwen3-ASR engine with GGUF optimization.

🔒

Complete Privacy

Every word you speak is processed locally. Nothing is ever sent to a server. Your data stays on your machine, always.

Instant Transcription

GPU-accelerated inference with Vulkan for NVIDIA, AMD, and Intel GPUs. Your words appear the moment you finish speaking.

🌐

36 Languages

Automatic language detection across 36 languages. Speak in any supported language and the engine adapts instantly.

✈️

Fully Offline

Download the model once (1.0 – 3.2 GB depending on tier). No internet connection needed after setup.

🎓

Personal Training Coming Q3 2026

Fine-tune the model on your voice and vocabulary using LoRA. Not in v1.0 — shipping once our PyTorch-to-GGUF conversion pipeline is ready. Free with Personal and Business licenses.

💰

Pay Once, Own Forever

Perpetual license. No monthly fees, no usage limits, no token counting. One purchase, lifetime access.

🎵

Built-in Noise Reduction

DeepFilter AI noise suppression removes background noise in real-time before transcription, improving accuracy in noisy environments.

⌨️

Direct Text Injection

Transcribed text is typed directly into any application. Works with any text field, code editor, or chat window.

📈

6 Quality Tiers

From Fast Light (1.0 GB) to Max Full (3.2 GB). Choose the perfect balance of quality, speed, and resource usage for your hardware.

Choose your quality level

Six tiers optimized for different hardware. All use the Qwen3-ASR architecture with GGUF quantized decoding.

Max Full

1.7B · FP32 encoder · Q5_K_M decoder
3.2 GB

Maximum accuracy. Zero quality compromise. Best for professional transcription work.

Max Balanced

1.7B · FP16 encoder · Q5_K_M decoder
2.6 GB

Near-identical quality at reduced size. Ideal for most users with a dedicated GPU.

Max Light

1.7B · INT8 encoder · Q5_K_M decoder
2.3 GB

Large model quality in a compact package. Great for systems with limited VRAM.

Fast Full

0.6B · FP32 encoder · Q5_K_M decoder
1.5 GB

Maximum accuracy from the compact model. Perfect when you want speed without quality loss.

Fast Light

0.6B · INT8 encoder · Q5_K_M decoder
1.0 GB

The smallest tier. Runs well on integrated GPUs and older hardware.

How we compare

Feature Brethof Voice Pro Dragon Google STT Otter.ai Whisper (OSS)
100% local processing
Perpetual license ~
Native Linux support ~
Native Windows support ~ ~
36 language auto-detection ~
GPU acceleration (NVIDIA + AMD + Intel) N/A N/A ~
Personal model fine-tuning (coming Q3 2026)
Built-in noise reduction
Direct text injection
Polished desktop GUI
Typical cost $49 once $350+/yr $17/mo $17/mo Free

Pay once. Own forever.

No monthly fees. No usage limits. Perpetual license with 1 year of updates included.

Launch Promotion — Limited Time

Save 50% on Personal and 40% on Business licenses. Price locks in at purchase.

Get Launch Price
Free Trial
$ 0
14 days, all features unlocked

No credit card required. Just an email to verify your trial.

  • All 6 model tiers
  • GPU acceleration
  • 36 languages
  • Noise reduction
  • × No personal training (paid plans only, coming Q3 2026)
  • × 14-day limit
Start Free Trial
Business
$ 149 /seat
Regular price: $249/seat
⚡ Launch promo — limited time

Per-seat perpetual license. Team & organization use. 1 year of updates.

  • Perpetual license (per seat)
  • Team & organization use
  • All 6 model tiers
  • Personal model training (coming Q3 2026)
  • Priority support
  • Volume discounts (10+ seats)
Buy Business License

Prices excl. tax. Then $20/seat/year for updates (optional)

Frequently asked questions

No. Brethof Voice Pro processes everything locally on your device. No audio or text data ever leaves your computer. There is no cloud component, no telemetry, and no analytics.

Any modern GPU works. NVIDIA GPUs use CUDA acceleration. AMD and Intel GPUs use Vulkan acceleration. You can also run on CPU only, though transcription will be slower. The Fast Light tier (1.0 GB) runs well even on integrated graphics.

Start with Fast Balanced (1.2 GB) — it is the default and offers excellent quality for its size. If you need maximum accuracy for professional work, try Max Full (3.2 GB). If you are on older hardware or want minimal resource usage, try Fast Light (1.0 GB). You can switch tiers at any time from the app settings.

Yes. Brethof Voice Pro supports both Linux and Windows natively. On Linux it works with X11 and Wayland. On Windows it runs as a standard desktop application.

Your license is perpetual — the app keeps working forever with whatever version you have. The optional $20/year Update Pass gives you access to new features and model improvements. Without it, you simply stay on your current version.

Not in v1.0 — but it is coming. Earlier builds had it via PyTorch; when we moved the inference engine to GGUF we had to disable training until the PyTorch-to-GGUF conversion pipeline is production-ready. Target: Q3 2026. It will be free for Personal and Business license holders when it ships, and all training will run locally on your machine. Full roadmap post.

Ready to try it?

14-day free trial. No credit card. No cloud. No compromises.