GitHub - microsoft/VibeVoice: Open-Source Frontier Voice AI

1 / 17

GitHub - microsoft/VibeVoice: Open-Source Frontier Voice AI

GitHub·microsoft·about 1 month ago

#Kn6eNy5K

#repo #handledialogclose #handleclose #contributing #showconsentmanagement #vibevoice

Reading 0:00

15s threshold

📰 News 2026-03-06: 🚀 VibeVoice ASR is now part of a Transformers release ! You can now use our speech recognition model directly through the Hugging Face Transformers library for seamless integration into your projects. 2026-01-21: 📣 We open-sourced VibeVoice-ASR , a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in Playground . ⭐️ VibeVoice-ASR is natively multilingual, supporting over 50 languages — check the supported languages for details. 🔥 The VibeVoice-ASR finetuning code is now available! ⚡️ vLLM inference is now supported for faster inference; see vllm-asr for more details. 📑 VibeVoice-ASR Technique Report is available.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

GitHub - microsoft/VibeVoice: Open-Source Frontier Voice AI