Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support for large scale data training(millions or even tens of millions of hours)? #1505

Closed
brainbpe opened this issue Feb 19, 2024 · 1 comment

Comments

@brainbpe
Copy link

The K2 framework does not support large-scale data (millions or even tens of millions of hours) of ASR training, such as the efficiency of multi-machine and multi-card GPU training is not perfect. Are there any plans to improve this in the future?

@kobenaxie
Copy link
Contributor

Actually we can use lhotse like Nemo-lhotse-dataset to load large scale data.

@JinZr JinZr closed this as completed Feb 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants