Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

noise vector in MelGAN and learning an accurate conditioning #25

Open
acids-ircam opened this issue Jun 10, 2020 · 0 comments
Open

noise vector in MelGAN and learning an accurate conditioning #25

acids-ircam opened this issue Jun 10, 2020 · 0 comments

Comments

@acids-ircam
Copy link

Hello,

There is something confusing in the paper, although it's in overall greatly written.

2.1 Generator architecture, it states that additional noise is not fed in the generator since it does not affect the perceptual quality of the result

2.3 Training objective, eq (1) and (2) state G(s,z) meaning that Generator processes both spectrogram and gaussian noise vector

If you inject noise, where is that happening please ?

This brings me to an other question. I read GAN-TTS, they inject noise when conditioning hidden activations wrt. speaker embedding. And they use conditional discriminator to ensure that the audio is both realistic and accurately conditioned.

If there is neither noise in MelGAN, nor conditional discriminator, how do you assess that the generator is learning and generalizing for the conditional generation please ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

1 participant