Skip to content
This repository has been archived by the owner on Oct 9, 2023. It is now read-only.

TTS speech Generation #1113

Open
flozi00 opened this issue Jan 12, 2022 · 8 comments
Open

TTS speech Generation #1113

flozi00 opened this issue Jan 12, 2022 · 8 comments
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed
Milestone

Comments

@flozi00
Copy link
Contributor

flozi00 commented Jan 12, 2022

🚀 Feature

Motivation

Pitch

Alternatives

One of very good providers is coqiu I think

Additional context

@flozi00 flozi00 added enhancement New feature or request help wanted Extra attention is needed labels Jan 12, 2022
@ethanwharris
Copy link
Collaborator

Hey @flozi00 thanks for the suggestion! Would you be interested in trying to contribute this task to Flash? We can help you out if there's anything you need 😃

@stale
Copy link

stale bot commented Mar 20, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the won't fix This will not be worked on label Mar 20, 2022
@stale stale bot closed this as completed Apr 16, 2022
@ethanwharris ethanwharris removed the won't fix This will not be worked on label Apr 17, 2022
@ethanwharris ethanwharris added this to the 0.8.0 milestone Apr 17, 2022
@ethanwharris ethanwharris reopened this Apr 17, 2022
@joowon-dm-snu
Copy link

hello @ethanwharris, Do you think TTS can be implemented soon?

@ethanwharris ethanwharris modified the milestones: 0.8.0, 0.9.0 Jun 29, 2022
@uakarsh
Copy link
Contributor

uakarsh commented Jul 6, 2022

Hi @ethanwharris, I am currently interested in implementing Deep Learning Models (especially multi-modal transformers, my recent works are here). So, would you mind, if I can take a look on the topic of TTS Speech Generation and models (since I want to explore a new domain and Speech Recognition would be amazing to explore with) and get back to you?

@krshrimali
Copy link
Contributor

Hi @ethanwharris, I am currently interested in implementing Deep Learning Models (especially multi-modal transformers, my recent works are here). So, would you mind, if I can take a look on the topic of TTS Speech Generation and models (since I want to explore a new domain and Speech Recognition would be amazing to explore with) and get back to you?

Hi, @uakarsh - great hearing from you! Thank you for showing interest. The team is on a company holiday for this week, so we are sorry if we were slow to respond but please go ahead and explore this issue. More than happy to see where this goes, I've assigned this issue to you. Please reach out in case you need any help. :)

@uakarsh
Copy link
Contributor

uakarsh commented Jul 6, 2022

Awesome then, looking forward to contributing something amazing to Flash

@uakarsh
Copy link
Contributor

uakarsh commented Jul 9, 2022

Hi @ethanwharris @krshrimali, I have been exploring Audio Processing and TTS from past few days. I think, there are a few things like: audio Transformations (similar to torchvision.transforms), different models.

How about, if we integrate this to Flash? It would help in loading any type of Audio Dataset with Text and train it/fine-tune it. (I guess, I have to write it entirely again, in order to allow the users to properly use it)

Although, there are a many types of models out there, but this was an end-to-end model, so definitely thought of pitching it. In coming time, we can try to add more models (not sure about how to integrate Hugging Face models, since I was not able to find proper scripts to train, but would search more).

@stale
Copy link

stale bot commented Sep 22, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the won't fix This will not be worked on label Sep 22, 2022
@ethanwharris ethanwharris removed the won't fix This will not be worked on label Oct 1, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

5 participants