Add steps for document of getting dataset 'SF Bilingual Speech' #7378

RobinDong · 2023-09-06T11:20:20Z

What does this PR do ?

When following the document to get dataset SFSpeech Chinese/English Bilingual Speech, the first command

python scripts/dataset_processing/tts/sfbilingual/get_data.py \
    --data-root <your_local_dataset_root> \
    --val-size 0.1 \
    --test-size 0.2 \
    --seed-for-ds-split 100

directly raise error:

Traceback (most recent call last):
  File "/home/xxx/NeMo/scripts/dataset_processing/tts/sfbilingual/get_data.py", line 122, in <module>
    main()
  File "/home/xxx/NeMo/scripts/dataset_processing/tts/sfbilingual/get_data.py", line 116, in main
    __process_data(
  File "/home/xxx/NeMo/scripts/dataset_processing/tts/sfbilingual/get_data.py", line 91, in __process_data
    entries = __process_transcript(dataset_path)
  File "/home/xxx/NeMo/scripts/dataset_processing/tts/sfbilingual/get_data.py", line 65, in __process_transcript
    with open(file_path / "text_SF.txt", encoding="utf-8") as fin:
FileNotFoundError: [Errno 2] No such file or directory: 'tryit/text_SF.txt'

The reason: scripts/dataset_processing/tts/sfbilingual/get_data.py actually doesn't download the dataset. The SFSpeech Chinese/English Bilingual Speech could only be downloaded through Nvidia NGC.

Changelog

Add steps in the document of dataset SFSpeech to download by ngc-cli tool at first

Before your PR is "Ready for review"

Pre checks:

Make sure you read and follow Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

@blisc @okuchaiev @titu1994 @XuesongYang

XuesongYang · 2023-09-19T07:37:10Z

Thanks for the fix. I made some changes upon yours.

Signed-off-by: Robin Dong <[email protected]>

added a link from a tutorial demonstrating detailed data prep steps. Signed-off-by: Xuesong Yang <[email protected]>

github-actions bot added the TTS label Sep 6, 2023

RobinDong force-pushed the main branch 2 times, most recently from b752870 to 91160f1 Compare September 13, 2023 03:18

RobinDong force-pushed the main branch from 689cdaa to d2193da Compare September 15, 2023 07:36

blisc requested a review from XuesongYang September 18, 2023 18:28

XuesongYang self-assigned this Sep 19, 2023

XuesongYang approved these changes Sep 19, 2023

View reviewed changes

RobinDong and others added 2 commits September 19, 2023 19:19

Add steps for document of getting dataset 'SF Bilingual Speech'

56d80e6

Signed-off-by: Robin Dong <[email protected]>

Update datasets.rst

eb78d6b

added a link from a tutorial demonstrating detailed data prep steps. Signed-off-by: Xuesong Yang <[email protected]>

RobinDong force-pushed the main branch from 0fce6b0 to eb78d6b Compare September 19, 2023 09:19

blisc approved these changes Sep 19, 2023

View reviewed changes

blisc merged commit b5d4573 into NVIDIA:main Sep 19, 2023
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add steps for document of getting dataset 'SF Bilingual Speech' #7378

Add steps for document of getting dataset 'SF Bilingual Speech' #7378

RobinDong commented Sep 6, 2023 •

edited

Loading

XuesongYang commented Sep 19, 2023

Add steps for document of getting dataset 'SF Bilingual Speech' #7378

Add steps for document of getting dataset 'SF Bilingual Speech' #7378

Conversation

RobinDong commented Sep 6, 2023 • edited Loading

What does this PR do ?

Changelog

Before your PR is "Ready for review"

Who can review?

XuesongYang commented Sep 19, 2023

RobinDong commented Sep 6, 2023 •

edited

Loading