Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request - Finetuning or Pretraining for Urdu #414

Closed
hunzlausman opened this issue Jan 17, 2024 · 11 comments
Closed

Feature request - Finetuning or Pretraining for Urdu #414

hunzlausman opened this issue Jan 17, 2024 · 11 comments
Assignees
Labels
enhancement New feature or request

Comments

@hunzlausman
Copy link

Please tell me if I can fine-tune this model on urdu dataset or train from scratch with same architecture for Urdu??
In short please elaborate the architecture of this model and how it is so fast.

@hunzlausman hunzlausman added the enhancement New feature or request label Jan 17, 2024
@snakers4
Copy link
Owner

This year we will probably share a fine-tuning recipe

@hunzlausman
Copy link
Author

hunzlausman commented Jan 17, 2024 via email

@filtercodes
Copy link
Contributor

Any news on this??? For me this model turned out very much useless without fine-tuning, because I get too much false positives and that makes the system unstable. Will stick to other options until the fine-tuning recipe is available.

@hunzlausman
Copy link
Author

hunzlausman commented Mar 17, 2024 via email

@snakers4
Copy link
Owner

As a first step - we released the dataset - https://github.com/snakers4/silero-vad/tree/master/datasets

@markjosims
Copy link

This is excting news, I'm looking forward to the finetuning scripts being released

@snakers4
Copy link
Owner

The new VAD version was released just now - #2 (comment)

It supports more than 6,000 languages now

Fine-tuning code will be released soon

@hunzlausman
Copy link
Author

Please we need for speech to text not vad.

@filtercodes
Copy link
Contributor

filtercodes commented Jun 27, 2024

The new VAD version was released just now - #2 (comment)

It supports more than 6,000 languages now

Fine-tuning code will be released soon

Looking forward to try it out! Is the state in/out basically combined c and h? I see it's tensor: float32[2,?,128], while c and h were float32[2,batch,64]) each.

@filtercodes
Copy link
Contributor

Yes, it doesn't work with the python script. I'm having this error:

Screenshot 2024-06-28 at 09 31 49

@varrerohit
Copy link

Do we have any update on the finetuning version?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants