Cuda? #2

SpaceCowboy850 · 2024-01-09T15:09:16Z

Do you have any plans to support the other backends that LlamaCPP supports so that this can be accelerated?

xyzhang626 · 2024-01-15T08:17:33Z

sorry for the late reply.

Yes I do have the plan to support CUDA actually. But because of my personal issue it might be implemented months later. I would suggest you to use more serious repo if you get a GPU.

grantbey · 2024-01-25T09:14:33Z

Hey @xyzhang626 do you have any resources/pointers/tips on how CUDA is implemented in ggml? Unless I'm missing something there's basically zero documentation.

I've adapted this code to support a slightly different architecture for my needs, but I can't quite figure out how to begin with CUDA.

Any help would be appreciated. If I succeed I could also do a PR into this repo.

xyzhang626 · 2024-01-30T03:26:31Z

Sorry for the late reply @grantbey

Yes the lack of document is one of the biggest challenges for people who want to build something based the ggml. It's really annoying. I think the best way (or the only way) to do that is referring to more mature repo built with ggml, e,g, chatglm.cpp

grantnebula · 2024-01-30T11:33:43Z

Thanks @xyzhang626! That's sort of what I've been doing. I'll take a look at the example you gave, hopefully it's easier to follow than the ones I've seen elsewhere.

(edit: realised I replied from a different account oops)

xyzhang626 · 2024-02-04T06:47:41Z

hey @grantbey @grantnebula maybe you should look at this, which forks this repo, optimize code a lot and support CUDA!

Print only array

snowyu pushed a commit to snowyu/embeddings.cpp that referenced this issue Feb 4, 2024

Merge pull request xyzhang626#2 from sroussey/print-only-array

776867a

Print only array

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda? #2

Cuda? #2

SpaceCowboy850 commented Jan 9, 2024

xyzhang626 commented Jan 15, 2024

grantbey commented Jan 25, 2024

xyzhang626 commented Jan 30, 2024 •

edited

Loading

grantnebula commented Jan 30, 2024 •

edited

Loading

xyzhang626 commented Feb 4, 2024

Cuda? #2

Cuda? #2

Comments

SpaceCowboy850 commented Jan 9, 2024

xyzhang626 commented Jan 15, 2024

grantbey commented Jan 25, 2024

xyzhang626 commented Jan 30, 2024 • edited Loading

grantnebula commented Jan 30, 2024 • edited Loading

xyzhang626 commented Feb 4, 2024

xyzhang626 commented Jan 30, 2024 •

edited

Loading

grantnebula commented Jan 30, 2024 •

edited

Loading