About multispeaker multilingual model #132

Marioando · 2025-01-06T10:12:21Z

Hi, I'm training the model for a multilingual model but speaker and language are deeply entanglated. I just add a language embedding by following the path of the speaker embedding ( in text encoder, and cfm decoder). Can you give some suggestion to achieve code switching capabilities please. Thank you!

shivammehta25 · 2025-01-08T06:21:43Z

Hello,
Disentanglement is a research problem in itself. Are you adding language tag per token or per sentence? I would suggest add it per token. If you can get alignments, interleave some audio so that the model sees codeswitching while training. But you'll have to look more into code switching literature.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About multispeaker multilingual model #132

About multispeaker multilingual model #132

Marioando commented Jan 6, 2025

shivammehta25 commented Jan 8, 2025

About multispeaker multilingual model #132

About multispeaker multilingual model #132

Comments

Marioando commented Jan 6, 2025

shivammehta25 commented Jan 8, 2025