speech-synthesis

Here are 1,191 public repositories matching this topic...

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Jul 5, 2024
Python

espeak-ng / espeak-ng

Star

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

android text-to-speech speech-synthesis espeak espeak-ng

Updated Jul 4, 2024
C

EveryVoiceTTS / EveryVoice

Star

The EveryVoice TTS Toolkit - Text To Speech for your language

python text-to-speech speech pytorch tts speech-synthesis speech-processing language-revitalization low-resource-languages pytorch-lightning

Updated Jul 4, 2024
Python

voicepaw / so-vits-svc-fork

Star

so-vits-svc fork with realtime support, improved interface and more features.

lightning deep-learning realtime pytorch speech-synthesis gan hacktoberfest voice-conversion voice-changer pytorch-lightning hubert vits sovits so-vits-svc softvc contentvec

Updated Jul 4, 2024
Python

ssb22 / CedPane

Star

Chinese-English Dictionary Public-domain Additions for Names Etc (CedPane)

dictionary speech-synthesis chinese-text-segmentation romanization cantonese-language mandarin-chinese

Updated Jul 4, 2024

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jul 4, 2024
TypeScript

KoljaB / RealtimeTTS

Star

Converts text to speech in realtime

python text-to-speech realtime speech-synthesis

Updated Jul 4, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jul 4, 2024
Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion text-to-audio fastspeech2 vits hifi-gan audio-generation singing-voice-conversion vall-e audioldm naturalspeech2

Updated Jul 4, 2024
Python

BakerBunker / FreeV

Star

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

speech speech-synthesis vocoder interspeech

Updated Jul 4, 2024
Python

rany2 / edge-tts

Star

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

text-to-speech tts speech-synthesis

Updated Jul 3, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jul 3, 2024
Python

rohanmittal1163 / translator

Star

A user-friendly translator application that effortlessly converts text from one language to another. Its intuitive interface makes language translation quick and easy for everyone.

javascript css html translation speech-synthesis googletranslate navigator-api

Updated Jul 3, 2024
JavaScript

PaddlePaddle / PaddleSpeech

Star

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.