Built Vakh using Rust + Tauri. Fully offline STT that injects typed text into any Windows app using native SendInput API. ⚡ Status: BETA (works great, occasionally hilarious) Tech stack: - whisper-rs for inference (tiny.en model) - webrtc-vad for voice detection - cpal for audio capture - SQLite for session logging What I learned building this: ❌ Don't forward audio to two threads separately (RAM nightmare) ✅ VAD gating before Whisper = fewer hallucinations ✅ Chunked processing with overlap = better accuracy ✅ Tauri is amazing but FEMA rules are tricky Download: https://arbharadwaj.github.io/Vakh/ Open source: https://github.com/arbharadwaj/Vakh Happy to discuss Tauri quirks, thread synchronization pain, FFI bindings, or why my first architecture was completely wrong (but taught me everything). Issues/PRs welcome. Judgment-free zone here. submitted by /u/QuarterOverall5966 [link] [comments]