mHubert codeHiFiGAN is not multispeaker #46

ehosseiniasl · 2024-09-02T00:50:55Z

hello,

the provided vocoder checkpoint using mHubert does not support multi-speaker. Do you have a multi-speaker checkpoint?

mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj

The text was updated successfully, but these errors were encountered:

ehosseiniasl · 2024-09-02T01:31:17Z

@0nutation in the paper, you have mentioned that you have trained a multi-speaker vocoder. could you please share the checkpoint?

Unit Vocoder Due to limition of single speaker unit vocoder in (Polyak et al., 2021), we train a
multi-speaker unit HiFi-GAN to decode the speech signal from the discrete representation. The
HiFi-GAN architecture consists of a generator G and multiple discriminators D. The generator uses
look-up tables (LUT) to embed discrete representations and the embedding sequences are up-sampled
by a series of blocks composed of transposed convolution and a residual block with dilated layers.
The speaker embedding is concatenated to each frame in the up-sampled sequence. The discriminator
features a Multi-Period Discriminator (MPD) and a Multi-Scale Discriminator (MSD), which have
the same architecture as (Polyak et al., 2021).

RobinWitch · 2024-09-19T05:37:40Z

@ehosseiniasl According to https://github.com/facebookresearch/fairseq/blob/920a548ca770fb1a951f7f4289b4d3a0c1bc226f/examples/speech_to_speech/docs/textless_s2st_real_data.md?plain=1#L16
Maybe we should retrain a multi-speaker voder using different datasets instead of LJSpeech

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mHubert codeHiFiGAN is not multispeaker #46

mHubert codeHiFiGAN is not multispeaker #46

ehosseiniasl commented Sep 2, 2024

ehosseiniasl commented Sep 2, 2024 •

edited

Loading

RobinWitch commented Sep 19, 2024

mHubert codeHiFiGAN is not multispeaker #46

mHubert codeHiFiGAN is not multispeaker #46

Comments

ehosseiniasl commented Sep 2, 2024

ehosseiniasl commented Sep 2, 2024 • edited Loading

RobinWitch commented Sep 19, 2024

ehosseiniasl commented Sep 2, 2024 •

edited

Loading