36 Commits

Author SHA1 Message Date
Jianwei Yu cd945395d4 feat: set nginx workers to 2×dp for optimal HTTP throughput
Nginx worker_processes now defaults to 2×N (where N is the number of DP
replicas) instead of 'auto'. This ensures enough HTTP handler processes
to fully saturate all GPU backends under heavy concurrent load.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-03-27 09:16:05 +00:00
Jianwei Yu 3817f74d46 feat: nginx-based data parallel for optimal ASR throughput
When --dp N is specified (N > 1), the launcher now starts N independent
vLLM processes behind an nginx reverse proxy instead of using vLLM's
built-in DP coordinator. This avoids the single-process HTTP bottleneck
when handling large base64 audio payloads, achieving near-linear scaling
(7.2x with 8 GPUs at 4096 concurrent requests).

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-03-27 07:43:32 +00:00
JianweiYu 9634518ca4 Add data parallel (DP) support to vLLM server launcher
- Add --dp/--data-parallel-size flag for running independent model replicas
  across multiple GPUs with automatic load balancing behind a single port
- Add --tp/--tensor-parallel-size flag (previously hardcoded to 1)
- Update docs/vibevoice-vllm-asr.md with multi-GPU deployment guide
  covering DP, TP, and hybrid (DP × TP) configurations

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-03-24 11:53:31 +00:00
Damon-Salvetore 165e17e5ed fix: vllm-version-stable 2026-02-25 07:30:43 +00:00
YingboHAO bb54f78d0e feat: add hotwords support for vLLM ASR 2026-02-04 10:33:20 +00:00
YaoyaoChang e43c1e2cdb streaming use transformers==4.51.3 2026-02-03 00:30:52 -08:00
Jianwei Yu e16491d65e Merge pull request #228 from Damon-Salvetore/vllm-1
[Fix] Resolve occasional infinite loops during vLLM inference
2026-02-03 10:38:40 +08:00
YingboHAO e26f1c263f 1 2026-02-02 13:50:27 +00:00
Zhiliang Peng b2aee8015c Delete docs/VibeVoice-ASR-Report.pdf 2026-01-28 19:33:37 +08:00
YaoyaoChang 3140709188 update README 2026-01-27 21:06:31 +08:00
YaoyaoChang c435ae05d5 update README
Added a section on LoRA fine-tuning to the ASR documentation.
2026-01-27 21:01:40 +08:00
YaoyaoChang 0e1a0d39fd update README 2026-01-27 20:59:25 +08:00
YaoyaoChang 142a00112e update ASR README: multilingual 2026-01-27 20:58:10 +08:00
YaoyaoChang 4648c50ea0 update ASR Technical Report link to Arxiv 2026-01-27 12:58:06 +08:00
YaoyaoChang a69e77c036 1. unify env for TTS and ASR; 2. avoid transformers 5.0.0 temporarily 2026-01-26 03:29:02 -08:00
YingboHAO 1eb04f53a2 Replace install_deps.sh with start_server.py one-click deployment 2026-01-26 07:34:54 +00:00
YaoyaoChang e67b15f47d update 2026-01-25 21:41:42 -08:00
MLSDCherryPick d9068541cf 1 2026-01-25 16:11:02 +00:00
Jianwei Yu 3c50e50d18 Merge pull request #203 from Damon-Salvetore/vibevoice-vllm
Add vLLM plugin support for high-performance ASR serving
2026-01-24 16:17:10 +08:00
MLSDCherryPick 7d12252de3 Language support 2026-01-24 05:11:34 +00:00
MLSDCherryPick a3e99daedd Language support 2026-01-24 05:10:47 +00:00
YingboHAO 4df5b0582f Add vLLM plugin support for high-performance ASR serving 2026-01-23 17:32:24 +00:00
YaoyaoChang c0c2af984e update README for finetuning-asr 2026-01-22 06:20:11 -08:00
YaoyaoChang 5022277022 update README 2026-01-22 00:51:00 -08:00
YaoyaoChang 32a7040ce0 restructure README 2026-01-22 00:37:22 -08:00
YaoyaoChang ce90a49960 fix env bug 2026-01-21 22:03:52 -08:00
YaoyaoChang a3750c229b Revise VibeVoice-ASR documentation for clarity
Updated the description and key features of VibeVoice-ASR to clarify its capabilities and improve accuracy in transcription.
2026-01-22 02:59:10 +08:00
YaoyaoChang c4352fee63 fx 2026-01-21 10:36:27 -08:00
YaoyaoChang 616a167275 add ASR playground link 2026-01-21 10:26:17 -08:00
YaoyaoChang f7c6d2dec9 update asr eval results 2026-01-21 09:50:24 -08:00
YaoyaoChang c9c778cc58 fx 2026-01-21 08:25:53 -08:00
Zhiliang Peng 56cb11e7b2 Add VibeVoice-ASR 2026-01-21 22:18:33 +08:00
YaoyaoChang 4adbe76674 more experimental voices 2025-12-16 04:21:09 -08:00
Wenhui Wang d295d1e1d0 Update vibevoice-realtime-0.5b.md 2025-12-09 12:28:32 +08:00
YaoyaoChang 7ea24a4fb9 update 2025-12-04 22:33:57 -08:00
YaoyaoChang fc83be5d92 add VibeVoice-Realtime 2025-12-04 05:38:30 -08:00