A generative speech model for daily dialogue.
-
Updated
Oct 21, 2024 - Python
A generative speech model for daily dialogue.
⭐ 本科毕业设计:基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码,使用Django构建了前后端。
VoxNovel: generate audiobooks giving each character a different voice actor.
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
TTS models for Arabic (Tacotron2, FastPitch)
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
KAE : KAN-based AutoEncoder (AE, VAE, VQ-VAE, RVQ, etc.)
Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS
Sound classification on Urban Sound Dataset
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
High fidelity music synthesis using diffusion and UnivNet.
(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.
TTS (FastPitch) for German
🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.
Experiments in neural networks for audio generation.
Speech to Text with Wav2Vec2 using torchaudio
this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.
Add a description, image, and links to the torchaudio topic page so that developers can more easily learn about it.
To associate your repository with the torchaudio topic, visit your repo's landing page and select "manage topics."