Real-time dictation
Global hotkey capture with sub-100ms partials on capable hardware. Text injects into the focused field or copies to clipboard.
DESKTOP APP · v1.0.1
Local AI transcription and AI note taking powered by OpenAI Whisper Large — dictation, meetings, and batch exports without sending audio to the cloud.
Built on whisper.cpp · Model file ggml-large-v3.bin · From $10/mo
Hold your hotkey, speak, and watch text land in Slack, Word, VS Code, or any focused window. VocalFuse runs inference on your GPU or CPU — no round-trip to a cloud API, no per-minute billing surprises.
Always-on-top pill · drag grip to reposition
Pixel-accurate replicas of the Windows client from VocalFuse_cpp — 248×38 pill overlay, six capture states, settings dialog, and system tray menu.
Border #c4a04a · “Hold to speak”
Note switch on · “Click for notes”
Border #e2c06a · 16-bar visualizer
Border #9a7a34 · “Working...”
Pro batch capture · live meters
SMTP summary dispatch · “Sending notes...”
460px panel with Product License, Note Taking Mode (email SMTP), and Updates — matching settings.cpp.
Vocal Fuse
Required to use Vocal Fuse.
Product key verified.
Record continuously in 3-minute batches until stopped.
Sent via SMTP when a note session ends.
Compare your installed version with the latest release.
You are up to date.
Right-side controls: 36×16 note switch, settings gear, close button. Hover states shown below.
Tray context menu: Settings, Minimize, Show, Close — from tray.cpp.
#451218, drag grip 18px, logo 18px, Segoe UI labels, note switch 36×16. See the full wireframes in VocalFuse docs.
Subscribe on the web, install the desktop client, activate with your product key, and dictate anywhere.
Pick Basic ($10) or Pro ($15) on the pricing page. Stripe handles billing — cancel anytime.
Grab the installer from Downloads. First launch pulls ggml-large-v3.bin if needed.
Paste your product key, set your hotkey, enable GPU acceleration, and position the pill overlay where you want it.
Real-time speech to text into any app. Pro users batch-record meetings and export structured notes.
Global hotkey capture with sub-100ms partials on capable hardware. Text injects into the focused field or copies to clipboard.
Full Large weights — not a distilled cloud API model. Studio-grade accuracy for accents, jargon, and long-form narration.
Drop podcasts, interviews, and lectures. Export subtitles and documents without uploading audio anywhere.
Capture up to 3 minutes per batch, structure meeting notes locally, and email summaries via SMTP.
Microphone audio never leaves your machine. License checks use HTTPS; voice data does not.
License verification API, architecture docs, and local AI guides for teams embedding Fuse intelligence.
| Feature | Basic — $10/mo | Pro — $15/mo |
|---|---|---|
| OpenAI Whisper Large local dictation | ✓ | ✓ |
| Always-on-top pill widget | ✓ | ✓ |
| Hold-to-record hotkey | ✓ | ✓ |
| Batch file transcription (TXT, SRT, VTT, DOCX) | ✓ | ✓ |
| AI note taker mode | — | ✓ |
| 3-minute batch meeting capture | — | ✓ |
| Email summaries via SMTP | — | ✓ |
| Priority updates | — | ✓ |
Dictate commits, docs, and emails without leaving the IDE. Whisper Large handles technical vocabulary better than lightweight cloud models.
Keep sensitive conversations on-device. No third-party transcription bot joins your calls — audio stays in your trust boundary.
Transcribe lectures and podcasts offline. Pro turns long sessions into structured notes you can export or email.
| Operating system | Windows 10/11 (64-bit), macOS 12+, Ubuntu 22.04+ |
|---|---|
| Memory | 16 GB RAM recommended for OpenAI Whisper Large |
| GPU | Optional NVIDIA GPU with CUDA for lowest latency |
| Storage | ~3 GB for ggml-large-v3.bin |
| Model path | C:\VocalFuse\models\ |
| Permissions | Microphone; optional accessibility APIs for text injection |
| Account | Active VocalFuse subscription for downloads and license validation |
Create your account, subscribe, download VocalFuse.exe, and press your hotkey. No cloud account required beyond Fuse Intelligence.
VocalFuse uses OpenAI Whisper Large (ggml-large-v3) running locally through whisper.cpp — not a small cloud API or distilled model.
VocalFuse runs on Windows 10+, macOS 12+, and Ubuntu 22.04+. NVIDIA GPU acceleration is optional via CUDA.