-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Common toolbox issues and how to fix them #431
Comments
NoBackendErrorSummaryTypically fixed by installing ffmpeg. Windows users follow these instructions: https://video.stackexchange.com/a/20496 If you still experience NoBackendError after installing ffmpeg, try the below instructions and get support at librosa if needed. More informationThis error message occurs when opening a mp3 file. Audioread (a dependency of librosa) needs additional software to open mp3 files. The following is taken from https://github.com/librosa/librosa#audioread and may be helpful:
|
All questions about Colab NotebookThe Colab Notebook is a community-developed resource to enable users to run the toolbox without having a GPU or going through a complicated setup. Recently, these issues have been resolved with CPU support being added (#366) and the installation process streamlined (#375). We recommend that you use a normal Python environment. Users who still prefer using Colab Notebook should understand that no official support will be provided, though you are welcome to ask questions to get help from the community. If you believe you've found a bug with the underlying toolbox code, please try to replicate the issue in a normal Python environment. We are not Colab Notebook users and are unable to troubleshoot Colab Notebook errors. |
No module named 'tensorflow.contrib'SummaryThe toolbox requires Tensorflow 1.15 and this error message occurs when Tensorflow 2.x is installed. SolutionInstall Tensorflow 1.15. If you get a pip error |
GPU support for the toolboxConfiguring GPU support for the toolbox is difficult. Fortunately, the toolbox can run on the CPU. Download the If you must have GPU support, questions about CUDA installation should be submitted to a different support channel. Try asking your question in the CUDA setup and installation section of the NVIDIA developer forums: https://forums.developer.nvidia.com/c/accelerated-computing/cuda/cuda-setup-and-installation |
Pip error:
|
OSError: [WinError 193] %1 is not a valid Win32 applicationSummaryThis error message occurs when you have multiple python environments and they are conflicting with each other. For example, this is the traceback for #163:
From the traceback you see these python paths causing a conflict with each other: SolutionLook at your traceback and identify the conflicting python environments. Then check your |
Invalid syntax
|
How to train your own modelsFirst thing, we do not provide any official support for training a model with your own data, it is expected that anyone who trains a model is capable of coding in Python and solving the inevitable error messages on their own. You are welcome to open an issue if you get stuck but no one will walk you through the entire process. Most users will want to train a synthesizer model. In most cases the pretrained encoder and vocoder can be reused. 1. Practice training with LibriSpeechCorentin has a wiki page for replicating the training of the pretrained models: https://github.com/CorentinJ/Real-Time-Voice-Cloning/wiki/Training I recommend that you work through the preprocessing and training steps for the synthesizer, using the LibriSpeech 2. Dataset preparationAssembling the dataset is perhaps the hardest part for most users. For training with your own data, you will need to get your dataset into this format: #437 (comment) Once you successfully preprocess the data then the training commands will work as before. Additional notesFinetuning a single-speaker modelIf you do not require a multi-speaker model, use the process in #437 to finetune the existing models to a single speaker. This can be done in a reasonable amount of time on CPU. Considerations - languages other than English
|
For over 2 months, there has been nothing asked frequently enough to update the FAQ and the support questions have subsided, so there may not be a need for this anymore. Though I am closing this, the contents are still searchable. |
Is it possible to get a phoneme printed output live? (even inaccurate) |
This issue will be used to document common toolbox issues and how to fix them. Please do not reply here to keep the signal/noise high. Instead, report problems and suggest additions/improvements by opening a new issue.
The text was updated successfully, but these errors were encountered: