Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Efficiently Setting Up Dependencies for Karpathy's llm.c on Kaggle: Seeking Guidance #1416

Open
dewijones92 opened this issue Aug 1, 2024 · 0 comments
Assignees
Labels
new-package Requests for installing new packages

Comments

@dewijones92
Copy link

Hello Kaggle! 😊

I wasn't sure where the best place to ask something like this was, so I hope it's okay to open this GitHub issue.

Objective

I want to compile and run Andrej Karpathy’s code from this repository.

Requirements

To achieve this, I need to install the following dependencies:

sudo apt-get -y install nvidia-cuda-toolkit
sudo apt install libnccl2 libnccl-dev
sudo apt-get update
sudo apt-get install libnvtoolsext1
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt-get update
sudo apt-get -y install nvidia-cuda-toolkit  
sudo apt-get -y install libcudnn9-dev-cuda-12
sudo apt install libnccl2 libnccl-dev
pip install tiktoken  # This is used to tokenize the data to be passed into llm.c

Additionally, the instructions specify that I need to clone this repository into my home directory.

Dockerfile Concerns

I have looked into adding the above dependencies into the Dockerfile.tmpl, but I found the following lines in there:

# Make sure we are on the right version of CUDA
RUN update-alternatives --set cuda /usr/local/cuda-$CUDA_MAJOR_VERSION.$CUDA_MINOR_VERSION
# NVIDIA binaries from the host are mounted to /opt/bin.
ENV PATH=/opt/bin:${PATH}

It seems like the convention is NOT to use apt-get install here. Am I able to install the above dependencies (e.g., nvidia-cuda-toolkit, etc.) in this Dockerfile? What are your thoughts?

Current Issue

At the moment, I have to install all the dependencies each time I start up the notebook, which is incredibly slow and tedious, as you can imagine.

Compilation Method

I am using a method similar to this notebook to compile and run Karpathy’s code.

Thanks!

Thank you so much for Kaggle! 😄😄

@calderjo calderjo self-assigned this Oct 2, 2024
@calderjo calderjo added the new-package Requests for installing new packages label Oct 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new-package Requests for installing new packages
Projects
None yet
Development

No branches or pull requests

2 participants