-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Next WeNet Roadmap #1683
Comments
From Google's recent USM paper, we can see the following three points: 1 injecting tezt 2 Simpler pre-training 3 Text to speech intermediate representation I think these three are the ultimate weapons for speech recognition, whether it is from the signal level or the text level。 And the community is a good way to cooperate to make the big model or the road of the new pipeline |
For 2: sipmpler pretrin: May be bestrq is good start : https://github.com/wenet-e2e/wenet/tree/Mddct-bestrq/wenet/ssl/bestrq |
@Mddct shows his insight on general speech recognition task, it's great. |
This issue has been automatically closed due to inactivity. |
We will mainly focus on the following two problems in
Next WeNet
.open source big models
+task/private data
may be the new paradigm for the next AI.We are open for other proposals. WeNet is a community-driven project and we love your feedback and proposals on where we should be heading. Feel free to volunteer yourself if you are interested in trying out some items(they do not have to be on the list).
The text was updated successfully, but these errors were encountered: