Update README with new TTS report and ICLR oral acceptance
Updated TTS report link and added conference acceptance note.
This commit is contained in:
@@ -3,7 +3,7 @@
|
|||||||
## 🎙️ VibeVoice: Open-Source Frontier Voice AI
|
## 🎙️ VibeVoice: Open-Source Frontier Voice AI
|
||||||
[](https://microsoft.github.io/VibeVoice)
|
[](https://microsoft.github.io/VibeVoice)
|
||||||
[](https://huggingface.co/collections/microsoft/vibevoice-68a2ef24a875c44be47b034f)
|
[](https://huggingface.co/collections/microsoft/vibevoice-68a2ef24a875c44be47b034f)
|
||||||
[](https://arxiv.org/pdf/2508.19205)
|
[](https://openreview.net/pdf?id=FihSkzyxdv)
|
||||||
[](https://arxiv.org/pdf/2601.18184)
|
[](https://arxiv.org/pdf/2601.18184)
|
||||||
[](https://colab.research.google.com/github/microsoft/VibeVoice/blob/main/demo/VibeVoice_colab.ipynb)
|
[](https://colab.research.google.com/github/microsoft/VibeVoice/blob/main/demo/VibeVoice_colab.ipynb)
|
||||||
[](https://aka.ms/vibevoice-asr)
|
[](https://aka.ms/vibevoice-asr)
|
||||||
@@ -44,7 +44,7 @@ https://github.com/user-attachments/assets/db0bb23f-ae06-4135-a66a-1ff1669f4f84
|
|||||||
2025-09-05: VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have removed the VibeVoice-TTS code from this repository.
|
2025-09-05: VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have removed the VibeVoice-TTS code from this repository.
|
||||||
|
|
||||||
|
|
||||||
2025-08-25: 📣 We open-sourced <a href="docs/vibevoice-tts.md"><strong>VibeVoice-TTS</strong></a>, a long-form multi-speaker text-to-speech model that can synthesize speech up to 90 minutes long with up to 4 distinct speakers.
|
2025-08-25: 📣 We open-sourced <a href="docs/vibevoice-tts.md"><strong>VibeVoice-TTS</strong></a>, a long-form multi-speaker text-to-speech model that can synthesize speech up to 90 minutes long with up to 4 distinct speakers. — accepted as an [Oral](https://openreview.net/forum?id=FihSkzyxdv) at ICLR 2026! 🔥
|
||||||
|
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user