A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
Updated
Jul 5, 2024 - Python
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
The EveryVoice TTS Toolkit - Text To Speech for your language
so-vits-svc fork with realtime support, improved interface and more features.
Chinese-English Dictionary Public-domain Additions for Names Etc (CedPane)
🧠 Leon is your open-source personal assistant.
Converts text to speech in realtime
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
End-to-End Speech Processing Toolkit
A user-friendly translator application that effortlessly converts text from one language to another. Its intuitive interface makes language translation quick and easy for everyone.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Open singing synthesis platform / Open source UTAU successor
🔊 Cross browser Speech Synthesis also known as Text to speech or TTS; no dependencies; uses Web Speech API
Lingvo
MARS5 speech model (TTS) from CAMB.AI
A high-quality speech analysis, manipulation and synthesis system
A rust crate for easily implementing Text-To-Speech into your rust programs.
Add a description, image, and links to the speech-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the speech-synthesis topic, visit your repo's landing page and select "manage topics."