MLSDCherryPick
71356b87dd
Language support
2026-01-24 05:17:26 +00:00
MLSDCherryPick
7d12252de3
Language support
2026-01-24 05:11:34 +00:00
MLSDCherryPick
a3e99daedd
Language support
2026-01-24 05:10:47 +00:00
YingboHAO
04f8bc40b0
Update test_api.py
2026-01-23 17:47:31 +00:00
YingboHAO
4df5b0582f
Add vLLM plugin support for high-performance ASR serving
2026-01-23 17:32:24 +00:00
YaoyaoChang
c0c2af984e
update README for finetuning-asr
2026-01-22 06:20:11 -08:00
Zhiliang Peng
05e1a022e5
Update FT README
...
Clarified the purpose of the toy dataset in the README.
2026-01-22 21:49:47 +08:00
Zhiliang Peng
59c90e7633
Merge pull request #197 from pengzhiliang/vibevoice_asr_ft
...
add VibeVoice-ASR finetuning code
2026-01-22 21:45:35 +08:00
pengzhiliang
8516386ce4
update ft readme
2026-01-22 05:44:34 -08:00
pengzhiliang
cef628e1b5
update ft code
2026-01-22 05:20:25 -08:00
pengzhiliang
db2f1d9ff3
init vibevoice-asr ft
2026-01-22 05:04:33 -08:00
YaoyaoChang
875115c000
update README
2026-01-22 01:28:21 -08:00
YaoyaoChang
c0d7616e5a
update README
2026-01-22 01:26:44 -08:00
YaoyaoChang
0e0caf2f08
update README
2026-01-22 01:25:30 -08:00
YaoyaoChang
96f8ac6a49
update README
2026-01-22 01:24:58 -08:00
YaoyaoChang
0f8954a600
update README
2026-01-22 01:21:56 -08:00
YaoyaoChang
eb3533d791
update README
2026-01-22 00:51:33 -08:00
YaoyaoChang
5022277022
update README
2026-01-22 00:51:00 -08:00
YaoyaoChang
6c523ec087
update README
2026-01-22 00:49:58 -08:00
YaoyaoChang
883e3acc67
update README
2026-01-22 00:39:49 -08:00
YaoyaoChang
32a7040ce0
restructure README
2026-01-22 00:37:22 -08:00
YaoyaoChang
ce90a49960
fix env bug
2026-01-21 22:03:52 -08:00
MLSDCherryPick
1b6e8b56ea
asr evaluation
2026-01-22 03:44:34 +00:00
MLSDCherryPick
84e469c68e
asr evaluation
2026-01-22 03:43:31 +00:00
MLSDCherryPick
c03a707fd6
add video demo
2026-01-21 19:43:50 +00:00
YaoyaoChang
a3750c229b
Revise VibeVoice-ASR documentation for clarity
...
Updated the description and key features of VibeVoice-ASR to clarify its capabilities and improve accuracy in transcription.
2026-01-22 02:59:10 +08:00
YaoyaoChang
c4352fee63
fx
2026-01-21 10:36:27 -08:00
YaoyaoChang
616a167275
add ASR playground link
2026-01-21 10:26:17 -08:00
YaoyaoChang
f7c6d2dec9
update asr eval results
2026-01-21 09:50:24 -08:00
YaoyaoChang
c9c778cc58
fx
2026-01-21 08:25:53 -08:00
Zhiliang Peng
56cb11e7b2
Add VibeVoice-ASR
2026-01-21 22:18:33 +08:00
YaoyaoChang
6c7369bb31
fix
2025-12-16 17:12:12 -08:00
YaoyaoChang
4adbe76674
more experimental voices
2025-12-16 04:21:09 -08:00
Wenhui Wang
d295d1e1d0
Update vibevoice-realtime-0.5b.md
2025-12-09 12:28:32 +08:00
YaoyaoChang
eb09b39cc3
fix
2025-12-08 20:20:11 -08:00
RaihanulHaque
9b06438560
feat: add __init__.py files to enable module imports
...
Add __init__.py files to vibevoice/modular and vibevoice/processor
directories to properly export classes and enable package imports.
This allows users to import the package after installation:
- from vibevoice import VibeVoiceStreamingForConditionalGenerationInference
- from vibevoice.modular import VibeVoiceStreamingConfig
- from vibevoice.processor import VibeVoiceStreamingProcessor
Fixes import errors when using `pip install -e .`
2025-12-09 10:48:11 +08:00
YaoyaoChang
c1c5e40bef
add star history
2025-12-08 18:41:51 -08:00
Wenhui Wang
73a9711d8e
Update vibevoice_tokenizer_processor.py
2025-12-09 10:16:55 +08:00
YaoyaoChang
04d19f8352
add experimental multi-lingual speakers
2025-12-08 08:29:00 -08:00
hydropix
79470ff576
Fix: Remove unnecessary Path() conversion for HuggingFace model IDs
...
The model_path was being converted to a Path object and then back to string
for from_pretrained() calls. This is unnecessary since HuggingFace accepts
strings directly, and causes issues on Windows where Path() converts forward
slashes to backslashes (e.g., "microsoft/VibeVoice-Realtime-0.5B" becomes
"microsoft\VibeVoice-Realtime-0.5B").
This fix:
- Keeps model_path as a string (no behavior change on Linux/macOS)
- Fixes Windows compatibility for HuggingFace repo IDs
- Removes redundant str() conversions
2025-12-08 10:27:58 +08:00
Wenhui Wang
a507d67f8e
Update README
2025-12-05 21:49:07 +08:00
Wenhui Wang
042656cb4b
Update README
2025-12-05 21:47:22 +08:00
YaoyaoChang
7ea24a4fb9
update
2025-12-04 22:33:57 -08:00
YaoyaoChang
adc08b1575
update title
2025-12-04 17:39:48 -08:00
YaoyaoChang
82d5f29842
Fix: Colab downloads occasionally get stuck
2025-12-04 07:20:36 -08:00
YaoyaoChang
fc83be5d92
add VibeVoice-Realtime
2025-12-04 05:38:30 -08:00
YaoyaoChang
e81395cf6d
News: VibeVoice-Realtime-0.5B
2025-12-03 21:06:19 -08:00
YaoyaoChang
8107244b0b
statement
2025-09-05 08:02:28 -07:00