-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vulkan #15
Vulkan #15
Conversation
I can no longer reproduce the build-time issue I had before.. strange. It seems to just work? I also added the three vulkan dependencies to the default nix derivation's |
ggerganov/llama.cpp#9582 I will add |
Running all tests concurrently crashes in CI as well as on my machine locally. I added Now for a strange one: the tests run in the CI runner, but I get different token outputs on the CI runner than I do on my machine locally. |
Ok so, I set up the nix derivation to also run unit tests (inside the nix build sandbox), using a software renderer from mesa. It reproduces the same error that we see in github actions. As in, the exact same nonsense tokens. Run |
OOOhkay- I think I have a lead. It has to do with over-allocating the "GPU" It runs fine and passes both tests if I set How do we address this? Can we ask the system for how much VRAM is available, and estimate how many layers fit? |
I think that the reason I'm not hitting this locally is just that I have plenty of VRAM for the small models we run on my machine. |
But it's not just a question of trying to load a model that's bigger than the VRAM available. I tried loading Gemma 2 27B - which is way bigger than the VRAM I have on my laptop - but it just fills my GPU and then offloads the rest to my system RAM. That's using Vulkan w/ my AMD 7700S. So maybe the error is specifically when using a specific software renderer? |
Maybe we should addd the following to
Setting the Vulkan features requires that a VulkanSDK in found for MacOS builds. That is, However, I am not sure whether that is blocked by rust-lang/cargo#1197 |
Libraries for the gd.extension should be updated to reflect the new folder structure as well. Updated: otherwise cool, lgtm! |
This PR adds the vulkan feature to the llama-cpp-2 depdendency.
It also adds a few vulkan dependencies to the
shell.nix
environment.