WaveGAN-pytorch

PyTorch implementation of Synthesizing Audio with Generative Adversarial Networks(Chris Donahue, Feb 2018).

Befor running, make sure you have the sc09 dataset, and put that dataset under your current filepath.

Quick Start:

Installation

sudo apt-get install libav-tools

Download dataset

sc09: sc09 raw WAV files, utterances of spoken english words '0'-'9'
piano: Piano raw WAV files

Run

For sc09 task, make sure sc09 dataset under your current project filepath befor run your code.

$ python train.py

Training time

For SC09 dataset, 4 X Tesla P40 takes nearly 2 days to get reasonable result.
For piano piano dataset, 2 X Tesla P40 takes 3-6 hours to get reasonable result.
Increase the BATCH_SIZE from 10 to 32 or 64 can acquire shorter per-epoch time on multiple-GPU but slower gradient descent learning rate.

Results

Generated "0-9": https://soundcloud.com/mazzzystar/sets/dcgan-sc09

Generated piano: https://soundcloud.com/mazzzystar/sets/wavegan-piano

Loss curve:

Architecture

TODO

Add some evaluation experiments, eg. inception score.

Contributions

This repo is based on chrisdonahue's and jtcramer's implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
imgs		imgs
README.md		README.md
config.py		config.py
logger.py		logger.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py
wavegan.py		wavegan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaveGAN-pytorch

Quick Start:

Training time

Results

Architecture

TODO

Contributions

About

Releases

Packages

Languages

mazzzystar/WaveGAN-pytorch

Folders and files

Latest commit

History

Repository files navigation

WaveGAN-pytorch

Quick Start:

Training time

Results

Architecture

TODO

Contributions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages