Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

按照困难的安装教程报错 #2140

Closed
LZH-0225 opened this issue Jul 12, 2022 · 3 comments
Closed

按照困难的安装教程报错 #2140

LZH-0225 opened this issue Jul 12, 2022 · 3 comments

Comments

@LZH-0225
Copy link

命令:sudo pip install -e . -i https://pypi.tuna.tsinghua.edu.cn/simple
报错:
Running setup.py develop for paddlespeech
ERROR: Command errored out with exit status 1:
command: /usr/bin/python3 -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/home/lizhaohui/PaddleSpeech-develop/setup.py'"'"'; file='"'"'/home/lizhaohui/PaddleSpeech-develop/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(file);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' develop --no-deps
cwd: /home/lizhaohui/PaddleSpeech-develop/
Complete output (149 lines):
fatal: not a git repository (or any of the parent directories): .git

__version__ = '0.0.0'


__commit__ = ''

write_version_py done
running develop
running egg_info
writing paddlespeech.egg-info/PKG-INFO
writing dependency_links to paddlespeech.egg-info/dependency_links.txt
writing entry points to paddlespeech.egg-info/entry_points.txt
writing requirements to paddlespeech.egg-info/requires.txt
writing top-level names to paddlespeech.egg-info/top_level.txt
reading manifest file 'paddlespeech.egg-info/SOURCES.txt'
writing manifest file 'paddlespeech.egg-info/SOURCES.txt'
running build_ext
Creating /usr/local/lib/python3.8/dist-packages/paddlespeech.egg-link (link to .)
paddlespeech 0.0.0 is already the active version in easy-install.pth
Installing paddlespeech script to /usr/local/bin
Installing paddlespeech_client script to /usr/local/bin
Installing paddlespeech_server script to /usr/local/bin

Installed /home/lizhaohui/PaddleSpeech-develop
Post Install...
apt update -y

WARNING: apt does not have a stable CLI interface. Use with caution in scripts.

命中:1 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64  InRelease
命中:2 http://ppa.launchpad.net/graphics-drivers/ppa/ubuntu focal InRelease
命中:3 https://download.docker.com/linux/ubuntu focal InRelease
命中:4 https://apt.repos.intel.com/mkl all InRelease
命中:5 http://mirrors.aliyun.com/ubuntu focal InRelease
命中:6 http://mirrors.aliyun.com/ubuntu focal-security InRelease
命中:7 http://mirrors.aliyun.com/ubuntu focal-updates InRelease
命中:8 http://mirrors.aliyun.com/ubuntu focal-proposed InRelease
命中:9 http://mirrors.aliyun.com/ubuntu focal-backports InRelease
正在读取软件包列表...
正在分析软件包的依赖关系树...
正在读取状态信息...
有 396 个软件包可以升级。请执行 ‘apt list --upgradable’ 来查看它们。
apt install -y bc flac jq vim tig tree sox pkg-config libsndfile1 libflac-dev libogg-dev libvorbis-dev libboost-dev swig python3-dev

WARNING: apt does not have a stable CLI interface. Use with caution in scripts.

正在读取软件包列表...
正在分析软件包的依赖关系树...
正在读取状态信息...
bc 已经是最新版 (1.07.1-2build1)。
libboost-dev 已经是最新版 (1.71.0.0ubuntu2)。
libflac-dev 已经是最新版 (1.3.3-1build1)。
libogg-dev 已经是最新版 (1.3.4-0ubuntu1)。
libvorbis-dev 已经是最新版 (1.3.6-2ubuntu1)。
pkg-config 已经是最新版 (0.29.1-0ubuntu4)。
python3-dev 已经是最新版 (3.8.2-0ubuntu2)。
flac 已经是最新版 (1.3.3-1build1)。
sox 已经是最新版 (14.4.2+git20190427-2)。
swig 已经是最新版 (4.0.1-5build1)。
tig 已经是最新版 (2.4.1-1ubuntu1)。
tree 已经是最新版 (1.8.0-1)。
libsndfile1 已经是最新版 (1.0.28-7ubuntu0.1)。
vim 已经是最新版 (2:8.1.2269-1ubuntu5.7)。
jq 已经是最新版 (1.6-1ubuntu0.20.04.1)。
下列软件包是自动安装的并且现在不需要了:
  linux-headers-5.4.0-44 linux-headers-5.4.0-44-generic
  linux-image-5.4.0-44-generic linux-modules-5.4.0-44-generic
  linux-modules-extra-5.4.0-44-generic
使用'sudo apt autoremove'来卸载它(它们)。
升级了 0 个软件包,新安装了 0 个软件包,要卸载 0 个软件包,有 396 个软件包未被升级。
echo "check_certificate = off" >> ~/.wgetrc
touch apt.done
# Ubuntu 16.04 透過 apt 會安裝 boost 1.58.0
# it seems that boost (1.54.0) requires higher version. After I switched to g++-5 it compiles normally.
apt install -y --allow-unauthenticated build-essential cmake libboost-system-dev libboost-thread-dev libboost-program-options-dev libboost-test-dev libeigen3-dev zlib1g-dev libbz2-dev liblzma-dev

WARNING: apt does not have a stable CLI interface. Use with caution in scripts.

正在读取软件包列表...
正在分析软件包的依赖关系树...
正在读取状态信息...
cmake 已经是最新版 (3.16.3-1ubuntu1)。
libboost-program-options-dev 已经是最新版 (1.71.0.0ubuntu2)。
libboost-system-dev 已经是最新版 (1.71.0.0ubuntu2)。
libboost-thread-dev 已经是最新版 (1.71.0.0ubuntu2)。
libbz2-dev 已经是最新版 (1.0.8-2)。
libboost-test-dev 已经是最新版 (1.71.0.0ubuntu2)。
libeigen3-dev 已经是最新版 (3.3.7-2)。
liblzma-dev 已经是最新版 (5.2.4-1ubuntu1.1)。
zlib1g-dev 已经是最新版 (1:1.2.11.dfsg-2ubuntu1.3)。
build-essential 已经是最新版 (12.8ubuntu1.1)。
下列软件包是自动安装的并且现在不需要了:
  linux-headers-5.4.0-44 linux-headers-5.4.0-44-generic
  linux-image-5.4.0-44-generic linux-modules-5.4.0-44-generic
  linux-modules-extra-5.4.0-44-generic
使用'sudo apt autoremove'来卸载它(它们)。
升级了 0 个软件包,新安装了 0 个软件包,要卸载 0 个软件包,有 396 个软件包未被升级。
apt-get install -y gcc-5 g++-5 && update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-5 50  && update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-5 50
正在读取软件包列表...
正在分析软件包的依赖关系树...
正在读取状态信息...
没有可用的软件包 g++-5,但是它被其它的软件包引用了。
这可能意味着这个缺失的软件包可能已被废弃,
或者只能在其他发布源中找到
然而下列软件包会取代它:
  gcc-10-test-results gcc-9-test-results:i386 gcc-10-test-results:i386
  gcc-9-test-results gcc-8-test-results:i386 gcc-8-test-results
  gcc-7-test-results

没有可用的软件包 gcc-5,但是它被其它的软件包引用了。
这可能意味着这个缺失的软件包可能已被废弃,
或者只能在其他发布源中找到

E: 软件包 gcc-5 没有可安装候选
E: 软件包 g++-5 没有可安装候选
make: *** [Makefile:37: kenlm.done] Error 100
/home/lizhaohui/PaddleSpeech-develop/setup.py:126: CMD: git rev-parse HEAD, Error: b''
/home/lizhaohui/PaddleSpeech-develop/setup.py:113: CMD: make, Error: None
/usr/lib/python3/dist-packages/requests/__init__.py:89: RequestsDependencyWarning: urllib3 (1.26.10) or chardet (3.0.4) doesn't match a supported version!
  warnings.warn("urllib3 ({}) or chardet ({}) doesn't match a supported "
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/home/lizhaohui/PaddleSpeech-develop/setup.py", line 331, in <module>
    setup(**setup_info)
  File "/usr/lib/python3/dist-packages/setuptools/__init__.py", line 144, in setup
    return distutils.core.setup(**attrs)
  File "/usr/lib/python3.8/distutils/core.py", line 148, in setup
    dist.run_commands()
  File "/usr/lib/python3.8/distutils/dist.py", line 966, in run_commands
    self.run_command(cmd)
  File "/usr/lib/python3.8/distutils/dist.py", line 985, in run_command
    cmd_obj.run()
  File "/home/lizhaohui/PaddleSpeech-develop/setup.py", line 176, in run
    self.execute(_post_install, (self.install_lib, ), msg="Post Install...")
  File "/usr/lib/python3.8/distutils/cmd.py", line 335, in execute
    util.execute(func, args, msg, dry_run=self.dry_run)
  File "/usr/lib/python3.8/distutils/util.py", line 303, in execute
    func(*args)
  File "/home/lizhaohui/PaddleSpeech-develop/setup.py", line 162, in _post_install
    check_call("make")
  File "/home/lizhaohui/PaddleSpeech-develop/setup.py", line 116, in check_call
    raise e
  File "/home/lizhaohui/PaddleSpeech-develop/setup.py", line 107, in check_call
    sp.check_call(
  File "/usr/lib/python3.8/subprocess.py", line 364, in check_call
    raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['make']' returned non-zero exit status 2.
/home/lizhaohui/PaddleSpeech-develop/tools
----------------------------------------

ERROR: Can't roll back paddlespeech; was not uninstalled
ERROR: Command errored out with exit status 1: /usr/bin/python3 -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/home/lizhaohui/PaddleSpeech-develop/setup.py'"'"'; file='"'"'/home/lizhaohui/PaddleSpeech-develop/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(file);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' develop --no-deps Check the logs for full command output.

@yt605155624
Copy link
Collaborator

请仔细看报错信息,没有 gcc 和 g++, 请先安装 :)

@LZH-0225
Copy link
Author

请仔细看报错信息,没有 gcc 和 g++, 请先安装 :)
我有两个问题:1、我想问一下,就是安装paddlespeech需要gcc-5,我把我原来的gcc卸载,换成gcc-5,但是我再去跑pytorch的代码出现这样的问题,请问你们有遇到吗?后来我把gcc升级成gcc-9,仍然会报错,想问一下你们是否遇到此类问题,有没有什么解决方法。
2、还有一个就是,我不太了解docker,可以麻烦更新一下那个困难安装模式的docker的redeme,可以写的更加流程化的吗?感觉这样对新手更加友好一些,谢谢。

Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.
Terminating: fork() called from a process already using GNU OpenMP, this is unsafe.

Error executing job with overrides: ['experiment=audio-classify', 'trainer.gpus=4', '+trainer.strategy=ddp']
Error executing job with overrides: ['experiment=audio-classify', 'trainer.gpus=4', '+trainer.strategy=ddp']
Traceback (most recent call last):
File "train.py", line 24, in main
return train(config)
File "/home/lizhaohui/simpleaudio-audio-classifier/src/training_pipeline.py", line 88, in train
trainer.fit(model=model, datamodule=datamodule)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 721, in _call_and_handle_interrupt
return self.strategy.launcher.launch(trainer_fn, *args, trainer=self, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/strategies/launchers/subprocess_script.py", line 93, in launch
return function(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 811, in _fit_impl
results = self._run(model, ckpt_path=self.ckpt_path)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1236, in _run
results = self._run_stage()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1323, in _run_stage
return self._run_train()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1345, in _run_train
self._run_sanity_check()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1413, in _run_sanity_check
val_loop.run()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 204, in run
self.advance(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/dataloader/evaluation_loop.py", line 154, in advance
dl_outputs = self.epoch_loop.run(self._data_fetcher, dl_max_batches, kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 199, in run
self.on_run_start(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 88, in on_run_start
self._data_fetcher = iter(data_fetcher)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/utilities/fetching.py", line 178, in iter
self.dataloader_iter = iter(self.dataloader)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 368, in iter
return self._get_iterator()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 314, in _get_iterator
return _MultiProcessingDataLoaderIter(self)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 965, in init
self._reset(loader, first_iter=True)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 996, in _reset
self._try_put_index()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1241, in _try_put_index
self._index_queues[worker_queue_idx].put((self._send_idx, index))
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 88, in put
self._start_thread()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 173, in _start_thread
self._thread.start()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 857, in start
self._started.wait()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 558, in wait
signaled = self._cond.wait(timeout)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 302, in wait
waiter.acquire()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler
_error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 194526) is killed by signal: Terminated.

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.
Error executing job with overrides: ['experiment=audio-classify', 'trainer.gpus=4', '+trainer.strategy=ddp']
Error executing job with overrides: ['experiment=audio-classify', 'trainer.gpus=4', '+trainer.strategy=ddp']
Traceback (most recent call last):
File "/home/lizhaohui/simpleaudio-audio-classifier/train.py", line 24, in main
return train(config)
File "/home/lizhaohui/simpleaudio-audio-classifier/src/training_pipeline.py", line 88, in train
trainer.fit(model=model, datamodule=datamodule)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 723, in _call_and_handle_interrupt
return trainer_fn(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 811, in _fit_impl
results = self._run(model, ckpt_path=self.ckpt_path)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1236, in _run
results = self._run_stage()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1323, in _run_stage
return self._run_train()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1345, in _run_train
self._run_sanity_check()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1413, in _run_sanity_check
val_loop.run()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 204, in run
self.advance(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/dataloader/evaluation_loop.py", line 154, in advance
dl_outputs = self.epoch_loop.run(self._data_fetcher, dl_max_batches, kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 199, in run
self.on_run_start(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 88, in on_run_start
self._data_fetcher = iter(data_fetcher)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/utilities/fetching.py", line 178, in iter
self.dataloader_iter = iter(self.dataloader)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 368, in iter
return self._get_iterator()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 314, in _get_iterator
return _MultiProcessingDataLoaderIter(self)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 965, in init
self._reset(loader, first_iter=True)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 996, in _reset
self._try_put_index()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1241, in _try_put_index
self._index_queues[worker_queue_idx].put((self._send_idx, index))
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 88, in put
self._start_thread()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 173, in _start_thread
self._thread.start()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 857, in start
self._started.wait()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 558, in wait
signaled = self._cond.wait(timeout)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 302, in wait
waiter.acquire()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler
_error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 195006) is killed by signal: Terminated.

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.
Traceback (most recent call last):
File "/home/lizhaohui/simpleaudio-audio-classifier/train.py", line 24, in main
return train(config)
File "/home/lizhaohui/simpleaudio-audio-classifier/src/training_pipeline.py", line 88, in train
trainer.fit(model=model, datamodule=datamodule)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 723, in _call_and_handle_interrupt
return trainer_fn(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 811, in _fit_impl
results = self._run(model, ckpt_path=self.ckpt_path)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1236, in _run
results = self._run_stage()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1323, in _run_stage
return self._run_train()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1345, in _run_train
self._run_sanity_check()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1413, in _run_sanity_check
val_loop.run()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 204, in run
self.advance(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/dataloader/evaluation_loop.py", line 154, in advance
dl_outputs = self.epoch_loop.run(self._data_fetcher, dl_max_batches, kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 199, in run
self.on_run_start(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 88, in on_run_start
self._data_fetcher = iter(data_fetcher)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/utilities/fetching.py", line 178, in iter
self.dataloader_iter = iter(self.dataloader)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 368, in iter
return self._get_iterator()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 314, in _get_iterator
return _MultiProcessingDataLoaderIter(self)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 965, in init
self._reset(loader, first_iter=True)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 996, in _reset
self._try_put_index()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1241, in _try_put_index
self._index_queues[worker_queue_idx].put((self._send_idx, index))
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 88, in put
self._start_thread()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 173, in _start_thread
self._thread.start()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 857, in start
self._started.wait()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 558, in wait
signaled = self._cond.wait(timeout)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 302, in wait
waiter.acquire()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler
_error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 194607) is killed by signal: Terminated.

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.
Traceback (most recent call last):
File "/home/lizhaohui/simpleaudio-audio-classifier/train.py", line 24, in main
return train(config)
File "/home/lizhaohui/simpleaudio-audio-classifier/src/training_pipeline.py", line 88, in train
trainer.fit(model=model, datamodule=datamodule)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 723, in _call_and_handle_interrupt
return trainer_fn(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 811, in _fit_impl
results = self._run(model, ckpt_path=self.ckpt_path)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1236, in _run
results = self._run_stage()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1323, in _run_stage
return self._run_train()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1345, in _run_train
self._run_sanity_check()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 1413, in _run_sanity_check
val_loop.run()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 204, in run
self.advance(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/dataloader/evaluation_loop.py", line 154, in advance
dl_outputs = self.epoch_loop.run(self._data_fetcher, dl_max_batches, kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/base.py", line 199, in run
self.on_run_start(*args, **kwargs)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 88, in on_run_start
self._data_fetcher = iter(data_fetcher)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/pytorch_lightning/utilities/fetching.py", line 178, in iter
self.dataloader_iter = iter(self.dataloader)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 368, in iter
return self._get_iterator()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 314, in _get_iterator
return _MultiProcessingDataLoaderIter(self)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 965, in init
self._reset(loader, first_iter=True)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 996, in _reset
self._try_put_index()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1241, in _try_put_index
self._index_queues[worker_queue_idx].put((self._send_idx, index))
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 88, in put
self._start_thread()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/multiprocessing/queues.py", line 173, in _start_thread
self._thread.start()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 857, in start
self._started.wait()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 558, in wait
signaled = self._cond.wait(timeout)
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/threading.py", line 302, in wait
waiter.acquire()
File "/home/lizhaohui/miniconda3/envs/py38/lib/python3.8/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler
_error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 194527) is killed by signal: Terminated.
​​​
Shift + Enter 换行

@yt605155624
Copy link
Collaborator

看起来程序想用 4 块 GPU 跑程序,你机器上是否没有足够的 GPU? 可以根据自己的机器情况调整这些参数

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants