From e67b15f47da702f085fd817e88b5975ec0d1c9b9 Mon Sep 17 00:00:00 2001 From: YaoyaoChang Date: Sun, 25 Jan 2026 21:41:42 -0800 Subject: [PATCH] update --- README.md | 2 +- docs/vibevoice-asr.md | 5 ++++- 2 files changed, 5 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index fbf238e..ac0a765 100644 --- a/README.md +++ b/README.md @@ -75,7 +75,7 @@ For more information, demos, and examples, please visit our [Project Page](https - **📝 Rich Transcription (Who, When, What)**: The model jointly performs ASR, diarization, and timestamping, producing a structured output that indicates *who* said *what* and *when*. -[📖 Documentation](docs/vibevoice-asr.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-ASR) | [🎮 Playground](https://aka.ms/vibevoice-asr) | [🛠️ Finetuning](finetuning-asr/README.md) +[📖 Documentation](docs/vibevoice-asr.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-ASR) | [🎮 Playground](https://aka.ms/vibevoice-asr) | [🛠️ Finetuning](finetuning-asr/README.md) | [📊 Paper](docs/VibeVoice-ASR-Report.pdf)

diff --git a/docs/vibevoice-asr.md b/docs/vibevoice-asr.md index 76c777c..bfc4a7f 100644 --- a/docs/vibevoice-asr.md +++ b/docs/vibevoice-asr.md @@ -7,7 +7,10 @@ **Model:** [VibeVoice-ASR-7B](https://huggingface.co/microsoft/VibeVoice-ASR)
**Demo:** [VibeVoice-ASR-Demo](https://aka.ms/vibevoice-asr)
-**Finetuning:** [finetune-guide](../finetuning-asr/README.md) +**Report:** [VibeVoice-ASR-Report](./VibeVoice-ASR-Report.pdf)
+**Finetuning:** [finetune-guide](../finetuning-asr/README.md)
+**vllm:** [vllm-asr](./vibevoice-vllm-asr.md)
+ ## 🔥 Key Features