From 96f8ac6a497dd4d674fa99bc7c6299ea7ae87ca2 Mon Sep 17 00:00:00 2001 From: YaoyaoChang Date: Thu, 22 Jan 2026 01:24:58 -0800 Subject: [PATCH] update README --- README.md | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.md b/README.md index 66786c7..962bf29 100644 --- a/README.md +++ b/README.md @@ -70,6 +70,12 @@ For more information, demos, and examples, please visit our [Project Page](https - **📝 Rich Transcription (Who, When, What)**: The model jointly performs ASR, diarization, and timestamping, producing a structured output that indicates *who* said *what* and *when*. +

+ DER
+ cpWER
+ tcpWER +

+ [📖 Documentation](docs/vibevoice-asr.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-ASR) | [🎮 Playground](https://aka.ms/vibevoice-asr) @@ -78,12 +84,6 @@ For more information, demos, and examples, please visit our [Project Page](https https://github.com/user-attachments/assets/acde5602-dc17-4314-9e3b-c630bc84aefa -

- DER
- cpWER
- tcpWER -

- ### 2. 🎙️ [VibeVoice-TTS](docs/vibevoice-tts.md) - Long-form Multi-speaker TTS @@ -102,11 +102,11 @@ https://github.com/user-attachments/assets/acde5602-dc17-4314-9e3b-c630bc84aefa Supports English, Chinese and other languages. -[📖 Documentation](docs/vibevoice-tts.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-1.5B) | [📊 Paper](https://arxiv.org/pdf/2508.19205) -
VibeVoice Results
+[📖 Documentation](docs/vibevoice-tts.md) | [🤗 Hugging Face](https://huggingface.co/microsoft/VibeVoice-1.5B) | [📊 Paper](https://arxiv.org/pdf/2508.19205) + **English**