diff --git a/README.md b/README.md index feb8606..63760bb 100644 --- a/README.md +++ b/README.md @@ -24,6 +24,7 @@ 2026-01-21: 📣 We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in [Playground](https://aka.ms/vibevoice-asr). - ⭐️ VibeVoice-ASR is natively multilingual, supporting over 100 languages — check the [supported languages](docs/vibevoice-asr.md#language-distribution) for details. - 🔥 The VibeVoice-ASR [finetuning code](finetuning-asr/README.md) is now available! +- ⚡️ **vLLM inference** is now supported for faster inference; see [vllm-asr](docs/vibevoice-vllm-asr.md) for more details. 2025-12-16: 📣 We added experimental speakers to VibeVoice‑Realtime‑0.5B for exploration, including multilingual voices in nine languages (DE, FR, IT, JP, KR, NL, PL, PT, ES) and 11 distinct English style voices. [Try it](docs/vibevoice-realtime-0.5b.md#optional-more-experimental-voices). More speaker types will be added over time.