Adding emotions the tacotron2 #493

astricks · 2021-05-19T21:54:38Z

astricks
May 19, 2021

I'm interested in adding emotion_id in addition to speaker_id so that while inference, I can choose which speaker as well as which emotion. The dataset would have to include an additional parameter (emotion_id) as well. This will enable me to do inference based on a specific speaker and specific emotion (angry, sad, happy, etc.)

I'm wondering how to go about doing this? Has something similar been done? Is this useful to anyone else?

erogol · 2021-05-20T08:36:51Z

erogol
May 20, 2021
Maintainer

You either need a dataset with emotion annotations and use these annotations as extra inputs to the model or if the dataset has no annotation but the speech is expressive then we can use sentiment classification to create these pseudo labels and train on them.

Curious to hear that others say. So maybe there are more alternatives.

12 replies

Sadam1195 Jun 14, 2021

This might be not specifically related but could be answer to your question. You can improve things until they reach to point of diminishing return after that no matter how much you train the model mostly likely the chances are you won't get things improve more significantly. Your bottle neck could be the data itself (quality or quantity) or the model you have choose to train. @astricks

astricks Jun 15, 2021
Author

I'm going to gather more data and re-try. Will share back results in a month.

astricks Jun 17, 2021
Author

Update - Instead of waiting for more data, I gave it another sincere try. This time, it worked.

This time,

I used the speaker encoder to calculate speaker embeddings and trained using the resulting external_speaker_encoder_file
I diligently reduced r and the learning rate

Here are some samples
Angry
Scared
Surprised

This was an interesting exercise. Thank you all for your helpful comments.

erogol Jun 18, 2021
Maintainer

@astricks good results and interesting use of the speaker encoder.

Maybe @thorstenMueller can try the same approach. 💪

ridhisoni7 Feb 10, 2024

@astricks do you have the code from this experiment up anywhere? would be very helpful for a project im working on atm! thanks :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding emotions the tacotron2 #493

{{title}}

Replies: 1 comment 12 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Adding emotions the tacotron2 #493

astricks May 19, 2021

Replies: 1 comment · 12 replies

erogol May 20, 2021 Maintainer

Sadam1195 Jun 14, 2021

astricks Jun 15, 2021 Author

astricks Jun 17, 2021 Author

erogol Jun 18, 2021 Maintainer

ridhisoni7 Feb 10, 2024

astricks
May 19, 2021

Replies: 1 comment 12 replies

erogol
May 20, 2021
Maintainer

astricks Jun 15, 2021
Author

astricks Jun 17, 2021
Author

erogol Jun 18, 2021
Maintainer