Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

himia dataset #991

Merged
merged 6 commits into from
Mar 9, 2023
Merged

himia dataset #991

merged 6 commits into from
Mar 9, 2023

Conversation

glynpu
Copy link
Contributor

@glynpu glynpu commented Mar 8, 2023

Hi, I am building a wake word recipe in icefall for this dataset.
Hope for your review for this data preparation part. @pzelasko

pzelasko
pzelasko previously approved these changes Mar 8, 2023
Copy link
Collaborator

@pzelasko pzelasko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, I left one comment. Could you also add an entry in the table in docs/corpus.rst?

output_dir = Path(output_dir)
output_dir.mkdir(parents=True, exist_ok=True)
manifests = defaultdict(dict)
train_dev_dataset_parts = ["train", "dev", "test", "cw_test"]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You might want to follow how eg LibriSpeech recipe does auto-discovery of the available parts in case somebody only needs a subset.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated according to your comments. Hope for your further reviews.

Copy link
Collaborator

@pzelasko pzelasko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@pzelasko pzelasko merged commit 62cd184 into lhotse-speech:master Mar 9, 2023
@pzelasko pzelasko added this to the v1.13 milestone Mar 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants