Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metadata.csv questions #9

Open
CHK-0000 opened this issue Mar 5, 2024 · 3 comments
Open

Metadata.csv questions #9

CHK-0000 opened this issue Mar 5, 2024 · 3 comments

Comments

@CHK-0000
Copy link

CHK-0000 commented Mar 5, 2024

(s) c:\NaturalSpeech2>python C:\NaturalSpeech2\Pattern_Generator.py -lj C:\NaturalSpeech2\dataset\train -hp C:\NaturalSpeech2\Hyper_Parameters.yaml
C:\Users\user.conda\envs\s\lib\site-packages\torch\nn\utils\weight_norm.py:28: UserWarning: torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.
warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.")
Traceback (most recent call last):
File "C:\NaturalSpeech2\Pattern_Generator.py", line 1443, in
lj_paths, lj_text_dict, lj_pronunciation_dict, lj_speaker_dict, lj_emotion_dict, lj_language_dict, lj_gender_dict = LJ_Info_Load(path= args.lj_path)
File "C:\NaturalSpeech2\Pattern_Generator.py", line 1042, in LJ_Info_Load
for line in open(os.path.join(path, 'metadata.csv').replace('\', '/'), 'r', encoding= 'utf-8-sig').readlines():
FileNotFoundError: [Errno 2] No such file or directory: 'C:/NaturalSpeech2/dataset/train/metadata.csv'


I'm getting the above error, where can I get the metadata.csv? Please let me know the format of the data. Thanks.

@CODEJIN
Copy link
Owner

CODEJIN commented Mar 5, 2024

Dear @ksl103177 ,

Hello. Is the 'C:\NaturalSpeech2\dataset' you specified where the LJSpeech dataset is located?
metadata.csv is located at the root of the dataset. To be exact, the unzipped LJSpeech dataset consists of one folder and two files as follows.

-wavs
-metadata.csv
-README

If there is no metadata.csv in your LJSpeech dataset, download it again from here and unzip it.

Best regards.

@CHK-0000
Copy link
Author

CHK-0000 commented Mar 5, 2024

Oh, thank you, I have another question.
In the Hyper_Parameters.yaml file, should I use the Path: 'C:\NaturalSpeech2\LJSpeech-1.1\wavs' path in Train_pattern as the train data and the data in Eval_Pattern should be in the same format?

@CODEJIN
Copy link
Owner

CODEJIN commented Sep 6, 2024

No. The train and eval paths are the locations where the training data is generated. Pattern_Generator.py outputs a completely separate pickled file from the dataset and inputs it into the specified folder. Therefore, the train and eval paths should not be LJSpeech-1.1 but rather a new folder should be designated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants