update

2026-01-25 21:41:42 -08:00
parent d9068541cf
commit e67b15f47d
2 changed files with 5 additions and 2 deletions
@@ -75,7 +75,7 @@ For more information, demos, and examples, please visit our [Project Page](https
 - **📝 Rich Transcription (Who, When, What)**:
  The model jointly performs ASR, diarization, and timestamping, producing a structured output that indicates *who* said *what* and *when*.

-[📖 Documentation](docs/vibevoice-asr.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-ASR) | [🎮 Playground](https://aka.ms/vibevoice-asr) | [🛠️ Finetuning](finetuning-asr/README.md)
+[📖 Documentation](docs/vibevoice-asr.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-ASR) | [🎮 Playground](https://aka.ms/vibevoice-asr) | [🛠️ Finetuning](finetuning-asr/README.md) |  [📊 Paper](docs/VibeVoice-ASR-Report.pdf)


 <p align="center">
@@ -7,7 +7,10 @@

 **Model:** [VibeVoice-ASR-7B](https://huggingface.co/microsoft/VibeVoice-ASR)<br>
 **Demo:** [VibeVoice-ASR-Demo](https://aka.ms/vibevoice-asr)<br>
-**Finetuning:** [finetune-guide](../finetuning-asr/README.md)
+**Report:** [VibeVoice-ASR-Report](./VibeVoice-ASR-Report.pdf)<br>
+**Finetuning:** [finetune-guide](../finetuning-asr/README.md)<br>
+**vllm:** [vllm-asr](./vibevoice-vllm-asr.md)<br>
+

 ## 🔥 Key Features