-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
inference failed #10
Comments
Thank you for your interest in our work! Could you provide more detailed runtime parameters and code? It would be even better if you could provide more complete error messages. |
Perhaps you could check the versions of torch, torchaudio, and librosa. |
now it has a new error , i just want to generate a song with my prompt , Waveform inference save path: ./log/latent_diffusion/quick_start/quick_start/infer_01-20-13:53_cfg_scale_3_ddim_100_n_cand_3 |
Thank you for your question! You need to modify the path for loading the melody_npy in ddpm.py, specifically: melody_npy = np.load("your_path/melody.npy"). You can download the corresponding weight files at https://huggingface.co/ManzhenWei/MG2/tree/main. I will add this step to the readme. |
thank your reply,now it has a new error, Waveform inference save path: ./log/latent_diffusion/quick_start/quick_start/infer_01-20-14:53_cfg_scale_3_ddim_100_n_cand_3 |
1 similar comment
thank your reply,now it has a new error, Waveform inference save path: ./log/latent_diffusion/quick_start/quick_start/infer_01-20-14:53_cfg_scale_3_ddim_100_n_cand_3 |
The issue occurred in the |
the same error is also there, Plotting: Restored training weights and i think the code "save_waveform" have problems ,
it output (3, 1, 163872) |
any reply ? why the waveform.shape is 3 , so it generate three audio pieces ? |
when i prepare the checkpoint and the env ,when is run bash quick_start.sh ,there is an error ,
SEED EVERYTHING TO 0
Seed set to 0
Add-ons: []
Traceback (most recent call last):
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/infer.py", line 141, in
infer(dataset_json, config_yaml, config_yaml_path, exp_group_name, exp_name)
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/infer.py", line 39, in infer
val_dataset = AudioDataset(
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/utilities/data/dataset.py", line 78, in init
self.build_dsp()
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/utilities/data/dataset.py", line 283, in build_dsp
self.STFT = Audio.stft.TacotronSTFT(
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/utilities/audio/stft.py", line 144, in init
self.stft_fn = STFT(filter_length, hop_length, win_length)
File "/data/tts/music/Awesome-Music-Generation/MMGen_train/utilities/audio/stft.py", line 42, in init
fft_window = pad_center(fft_window, filter_length)
TypeError: pad_center() takes 1 positional argument but 2 were given
The text was updated successfully, but these errors were encountered: