Jianwei Yu
3817f74d46
feat: nginx-based data parallel for optimal ASR throughput
...
When --dp N is specified (N > 1), the launcher now starts N independent
vLLM processes behind an nginx reverse proxy instead of using vLLM's
built-in DP coordinator. This avoids the single-process HTTP bottleneck
when handling large base64 audio payloads, achieving near-linear scaling
(7.2x with 8 GPUs at 4096 concurrent requests).
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com >
2026-03-27 07:43:32 +00:00
JianweiYu
9634518ca4
Add data parallel (DP) support to vLLM server launcher
...
- Add --dp/--data-parallel-size flag for running independent model replicas
across multiple GPUs with automatic load balancing behind a single port
- Add --tp/--tensor-parallel-size flag (previously hardcoded to 1)
- Update docs/vibevoice-vllm-asr.md with multi-GPU deployment guide
covering DP, TP, and hybrid (DP × TP) configurations
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com >
2026-03-24 11:53:31 +00:00
Damon-Salvetore
165e17e5ed
fix: vllm-version-stable
2026-02-25 07:30:43 +00:00
YingboHAO
bb54f78d0e
feat: add hotwords support for vLLM ASR
2026-02-04 10:33:20 +00:00
YaoyaoChang
e43c1e2cdb
streaming use transformers==4.51.3
2026-02-03 00:30:52 -08:00
Jianwei Yu
e16491d65e
Merge pull request #228 from Damon-Salvetore/vllm-1
...
[Fix] Resolve occasional infinite loops during vLLM inference
2026-02-03 10:38:40 +08:00
YingboHAO
e26f1c263f
1
2026-02-02 13:50:27 +00:00
Zhiliang Peng
b2aee8015c
Delete docs/VibeVoice-ASR-Report.pdf
2026-01-28 19:33:37 +08:00
YaoyaoChang
3140709188
update README
2026-01-27 21:06:31 +08:00
YaoyaoChang
c435ae05d5
update README
...
Added a section on LoRA fine-tuning to the ASR documentation.
2026-01-27 21:01:40 +08:00
YaoyaoChang
0e1a0d39fd
update README
2026-01-27 20:59:25 +08:00
YaoyaoChang
142a00112e
update ASR README: multilingual
2026-01-27 20:58:10 +08:00
YaoyaoChang
4648c50ea0
update ASR Technical Report link to Arxiv
2026-01-27 12:58:06 +08:00
YaoyaoChang
a69e77c036
1. unify env for TTS and ASR; 2. avoid transformers 5.0.0 temporarily
2026-01-26 03:29:02 -08:00
YingboHAO
1eb04f53a2
Replace install_deps.sh with start_server.py one-click deployment
2026-01-26 07:34:54 +00:00
YaoyaoChang
e67b15f47d
update
2026-01-25 21:41:42 -08:00
MLSDCherryPick
d9068541cf
1
2026-01-25 16:11:02 +00:00
Jianwei Yu
3c50e50d18
Merge pull request #203 from Damon-Salvetore/vibevoice-vllm
...
Add vLLM plugin support for high-performance ASR serving
2026-01-24 16:17:10 +08:00
MLSDCherryPick
7d12252de3
Language support
2026-01-24 05:11:34 +00:00
MLSDCherryPick
a3e99daedd
Language support
2026-01-24 05:10:47 +00:00
YingboHAO
4df5b0582f
Add vLLM plugin support for high-performance ASR serving
2026-01-23 17:32:24 +00:00
YaoyaoChang
c0c2af984e
update README for finetuning-asr
2026-01-22 06:20:11 -08:00
YaoyaoChang
5022277022
update README
2026-01-22 00:51:00 -08:00
YaoyaoChang
32a7040ce0
restructure README
2026-01-22 00:37:22 -08:00
YaoyaoChang
ce90a49960
fix env bug
2026-01-21 22:03:52 -08:00
YaoyaoChang
a3750c229b
Revise VibeVoice-ASR documentation for clarity
...
Updated the description and key features of VibeVoice-ASR to clarify its capabilities and improve accuracy in transcription.
2026-01-22 02:59:10 +08:00
YaoyaoChang
c4352fee63
fx
2026-01-21 10:36:27 -08:00
YaoyaoChang
616a167275
add ASR playground link
2026-01-21 10:26:17 -08:00
YaoyaoChang
f7c6d2dec9
update asr eval results
2026-01-21 09:50:24 -08:00
YaoyaoChang
c9c778cc58
fx
2026-01-21 08:25:53 -08:00
Zhiliang Peng
56cb11e7b2
Add VibeVoice-ASR
2026-01-21 22:18:33 +08:00
YaoyaoChang
4adbe76674
more experimental voices
2025-12-16 04:21:09 -08:00
Wenhui Wang
d295d1e1d0
Update vibevoice-realtime-0.5b.md
2025-12-09 12:28:32 +08:00
YaoyaoChang
7ea24a4fb9
update
2025-12-04 22:33:57 -08:00
YaoyaoChang
fc83be5d92
add VibeVoice-Realtime
2025-12-04 05:38:30 -08:00