From 7e73beec97c1e2cf085499fb319e128d78304e29 Mon Sep 17 00:00:00 2001 From: Yan Xia Date: Fri, 6 Mar 2026 13:32:21 +0800 Subject: [PATCH] Add news about VibeVoice ASR Transformers integration - Added announcement that VibeVoice ASR is now part of Transformers v5.3.0 release - Linked to the official Hugging Face Transformers release page - Positioned as the latest news item with today's date Co-Authored-By: Claude Sonnet 4.6 --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index e5abe2a..3e12545 100644 --- a/README.md +++ b/README.md @@ -22,7 +22,9 @@

📰 News

-2026-01-21: 📣 We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in [Playground](https://aka.ms/vibevoice-asr). +2026-03-06: 🚀 VibeVoice ASR is now part of a Transformers release! You can now use our speech recognition model directly through the Hugging Face Transformers library for seamless integration into your projects. + +2026-01-21: 📣 We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in [Playground](https://aka.ms/vibevoice-asr). - ⭐️ VibeVoice-ASR is natively multilingual, supporting over 50 languages — check the [supported languages](docs/vibevoice-asr.md#language-distribution) for details. - 🔥 The VibeVoice-ASR [finetuning code](finetuning-asr/README.md) is now available! - ⚡️ **vLLM inference** is now supported for faster inference; see [vllm-asr](docs/vibevoice-vllm-asr.md) for more details.