-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix apt repo issue for docker #3823
Conversation
Docker builds relating with nightly will fail at the moment since pytorch has released 1.7 rc version. |
dockers/base-cuda/Dockerfile
Outdated
@@ -34,7 +34,7 @@ SHELL ["/bin/bash", "-c"] | |||
|
|||
ENV PATH="$PATH:/root/.local/bin" | |||
|
|||
RUN apt-get update && apt-get install -y --no-install-recommends \ | |||
RUN apt-get clean && apt-get update && apt-get install -y --no-install-recommends \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems strange, you need update before install not clean..
RUN apt-get clean && apt-get update && apt-get install -y --no-install-recommends \ | |
RUN apt-get update && apt-get install -y --no-install-recommends \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Turns out it's CDN issue, some suggest clean and update
but let's wait and see if CDN update can solve this
NVIDIA/nvidia-docker#1392
NVIDIA/nvidia-docker#877
NVIDIA/nvidia-docker#1328
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, the problem is in
Reading package lists...
E: Failed to fetch https://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/Packages.gz File has unexpected size (47871 != 49498). Mirror sync in progress? [IP: 152.195.19.142 443]
Hashes of expected file:
- Filesize:49498 [weak]
- SHA256:332f3ee4e353b8a5e5a2bdd8fdbd47cf140c73822b82b328815f122e09e195a0
- SHA1:4dc8ef9a3ee3c97b3c26d46e07fdd83997e6880b [weak]
- MD5Sum:bbff3b9c3462257479d72521ee78ec29 [weak]
Release file created at: Wed, 23 Sep 2020 22:09:13 +0000
E: Some index files failed to download. They have been ignored, or old ones used instead.
not the update step...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That comes from this, I think
Get:10 https://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64 Packages [49.5 kB]
Err:10 https://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64 Packages
File has unexpected size (47871 != 49498). Mirror sync in progress? [IP: 152.195.19.142 443]
Hashes of expected file:
- Filesize:49498 [weak]
- SHA256:332f3ee4e353b8a5e5a2bdd8fdbd47cf140c73822b82b328815f122e09e195a0
- SHA1:4dc8ef9a3ee3c97b3c26d46e07fdd83997e6880b [weak]
- MD5Sum:bbff3b9c3462257479d72521ee78ec29 [weak]
Release file created at: Wed, 23 Sep 2020 22:09:13 +0000
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
so skip of the update
is just hiding the NVIDIA package issue not solving it.. :[
dockers/base-xla/Dockerfile
Outdated
@@ -28,7 +28,7 @@ ENV CONDA_ENV=lightning | |||
# show system inforation | |||
RUN lsb_release -a && cat /etc/*-release | |||
|
|||
RUN apt-get update && \ | |||
RUN apt-get clean && apt-get update && \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
RUN apt-get clean && apt-get update && \ | |
RUN apt-get update && \ |
dockers/base-cuda/Dockerfile
Outdated
@@ -34,7 +34,7 @@ SHELL ["/bin/bash", "-c"] | |||
|
|||
ENV PATH="$PATH:/root/.local/bin" | |||
|
|||
RUN apt-get update && apt-get install -y --no-install-recommends \ | |||
RUN apt-get clean && apt-get update && apt-get install -y --no-install-recommends \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, the problem is in
Reading package lists...
E: Failed to fetch https://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/Packages.gz File has unexpected size (47871 != 49498). Mirror sync in progress? [IP: 152.195.19.142 443]
Hashes of expected file:
- Filesize:49498 [weak]
- SHA256:332f3ee4e353b8a5e5a2bdd8fdbd47cf140c73822b82b328815f122e09e195a0
- SHA1:4dc8ef9a3ee3c97b3c26d46e07fdd83997e6880b [weak]
- MD5Sum:bbff3b9c3462257479d72521ee78ec29 [weak]
Release file created at: Wed, 23 Sep 2020 22:09:13 +0000
E: Some index files failed to download. They have been ignored, or old ones used instead.
not the update step...
Codecov Report
@@ Coverage Diff @@
## master #3823 +/- ##
=======================================
- Coverage 84% 80% -4%
=======================================
Files 111 111
Lines 8793 9379 +586
=======================================
+ Hits 7362 7515 +153
- Misses 1431 1864 +433 |
Is it okay to use CUDA 11? |
Downgrading to ubuntu 16.04 works fine |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pls, no 5 years old OS
# FROM nvidia/cuda:${CUDA_VERSION}-devel | ||
# FROM nvidia/cuda:${CUDA_VERSION}-devel-ubuntu20.04 | ||
# FROM nvidia/cuda:${CUDA_VERSION}-cudnn${CUDNN_VERSION}-devel-ubuntu18.04 | ||
FROM nvidia/cuda:${CUDA_VERSION}-cudnn${CUDNN_VERSION}-devel-ubuntu16.04 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is bad, this is too. old OS version, this is the almost unsupported OS (ends in April 2021)
What does this PR do?
Fixes # (issue)
Before submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.
Did you have fun?
Make sure you had fun coding 🙃