fx
This commit is contained in:
@@ -2,6 +2,7 @@
|
||||
|
||||
[](https://huggingface.co/microsoft/VibeVoice-ASR)
|
||||
[](https://aka.ms/vibevoice-asr)
|
||||
|
||||
**VibeVoice-ASR** is the latest addition to the **VibeVoice** family. While the original VibeVoice / VibeVoice-Realtime focused on expressive TTS, **VibeVoice-ASR** focuses on understanding long-form speech with high precision and rich metadata.
|
||||
|
||||
It is a unified speech-to-text model designed to handle **1-hour long-form audio** in a single pass, generating structured transcriptions containing **Who (Speaker), When (Timestamps), and What (Content)**, with support for **User-Customized Context**.
|
||||
|
||||
Reference in New Issue
Block a user