From 0e1a0d39fd06f01ac6bb22c2201a1899557e0109 Mon Sep 17 00:00:00 2001 From: YaoyaoChang Date: Tue, 27 Jan 2026 20:59:25 +0800 Subject: [PATCH] update README --- docs/vibevoice-asr.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/vibevoice-asr.md b/docs/vibevoice-asr.md index 3ec7581..21ef48a 100644 --- a/docs/vibevoice-asr.md +++ b/docs/vibevoice-asr.md @@ -9,7 +9,7 @@ **Demo:** [VibeVoice-ASR-Demo](https://aka.ms/vibevoice-asr)
**Report:** [VibeVoice-ASR-Report](https://arxiv.org/pdf/2601.18184)
**Finetuning:** [finetune-guide](../finetuning-asr/README.md)
-**vllm:** [vllm-asr](./vibevoice-vllm-asr.md)
+**vLLM:** [vLLM-asr](./vibevoice-vllm-asr.md)
## 🔥 Key Features @@ -24,7 +24,7 @@ The model jointly performs ASR, diarization, and timestamping, producing a structured output that indicates *who* said *what* and *when*. - **🌍 Multilingual & Code-Switching Support**: - It supports over 50 languages, requires no explicit language setting, and natively handles code-switching within and across utterances. Language distribution can be found [here](#language-distribution) + It supports over 50 languages, requires no explicit language setting, and natively handles code-switching within and across utterances. Language distribution can be found [here](#language-distribution). ## 🏗️ Model Architecture