Commit Graph

74 Commits

Author SHA1 Message Date
MLSDCherryPick c03a707fd6 add video demo 2026-01-21 19:43:50 +00:00
YaoyaoChang a3750c229b Revise VibeVoice-ASR documentation for clarity
Updated the description and key features of VibeVoice-ASR to clarify its capabilities and improve accuracy in transcription.
2026-01-22 02:59:10 +08:00
YaoyaoChang c4352fee63 fx 2026-01-21 10:36:27 -08:00
YaoyaoChang 616a167275 add ASR playground link 2026-01-21 10:26:17 -08:00
YaoyaoChang f7c6d2dec9 update asr eval results 2026-01-21 09:50:24 -08:00
YaoyaoChang c9c778cc58 fx 2026-01-21 08:25:53 -08:00
Zhiliang Peng 56cb11e7b2 Add VibeVoice-ASR 2026-01-21 22:18:33 +08:00
YaoyaoChang 6c7369bb31 fix 2025-12-16 17:12:12 -08:00
YaoyaoChang 4adbe76674 more experimental voices 2025-12-16 04:21:09 -08:00
Wenhui Wang d295d1e1d0 Update vibevoice-realtime-0.5b.md 2025-12-09 12:28:32 +08:00
YaoyaoChang eb09b39cc3 fix 2025-12-08 20:20:11 -08:00
RaihanulHaque 9b06438560 feat: add __init__.py files to enable module imports
Add __init__.py files to vibevoice/modular and vibevoice/processor
directories to properly export classes and enable package imports.

This allows users to import the package after installation:
- from vibevoice import VibeVoiceStreamingForConditionalGenerationInference
- from vibevoice.modular import VibeVoiceStreamingConfig
- from vibevoice.processor import VibeVoiceStreamingProcessor

Fixes import errors when using `pip install -e .`
2025-12-09 10:48:11 +08:00
YaoyaoChang c1c5e40bef add star history 2025-12-08 18:41:51 -08:00
Wenhui Wang 73a9711d8e Update vibevoice_tokenizer_processor.py 2025-12-09 10:16:55 +08:00
YaoyaoChang 04d19f8352 add experimental multi-lingual speakers 2025-12-08 08:29:00 -08:00
hydropix 79470ff576 Fix: Remove unnecessary Path() conversion for HuggingFace model IDs
The model_path was being converted to a Path object and then back to string
for from_pretrained() calls. This is unnecessary since HuggingFace accepts
strings directly, and causes issues on Windows where Path() converts forward
slashes to backslashes (e.g., "microsoft/VibeVoice-Realtime-0.5B" becomes
"microsoft\VibeVoice-Realtime-0.5B").

This fix:
- Keeps model_path as a string (no behavior change on Linux/macOS)
- Fixes Windows compatibility for HuggingFace repo IDs
- Removes redundant str() conversions
2025-12-08 10:27:58 +08:00
Wenhui Wang a507d67f8e Update README 2025-12-05 21:49:07 +08:00
Wenhui Wang 042656cb4b Update README 2025-12-05 21:47:22 +08:00
YaoyaoChang 7ea24a4fb9 update 2025-12-04 22:33:57 -08:00
YaoyaoChang adc08b1575 update title 2025-12-04 17:39:48 -08:00
YaoyaoChang 82d5f29842 Fix: Colab downloads occasionally get stuck 2025-12-04 07:20:36 -08:00
YaoyaoChang fc83be5d92 add VibeVoice-Realtime 2025-12-04 05:38:30 -08:00
YaoyaoChang e81395cf6d News: VibeVoice-Realtime-0.5B 2025-12-03 21:06:19 -08:00
YaoyaoChang 8107244b0b statement 2025-09-05 08:02:28 -07:00