-
Notifications
You must be signed in to change notification settings - Fork 369
Conversation
Aha, I was just about to reply to the BLOOM PR! I would suggest keeping BLOOM in - there's no reason to remove that code if someone can fix it with some work (we'll make an issue for it) - but we'll remove it from the list of features, and make a note on the crate itself. As for the rest, let me have a look... |
@@ -1,39 +1,31 @@ | |||
# LLaMA-rs | |||
|
|||
<!-- markdownlint-disable-file MD026 --> | |||
This project is a Rust port of | |||
[llama.cpp](https://github.com/ggerganov/llama.cpp) 🦙🦀🚀 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm thinking we'll remove this wording as we grow to accommodate more LLMs. I'll revise the wording on this after this PR lands, so nothing for you to do here - just mentioning it.
I don't see the value of keeping non-working code in the main branch. I would suggest keeping BLOOM in a feature branch until it's working. |
btw could you add me as a git commit author rather than a mention as you did when you upstreamed the ggml changes. @danforbes |
Code that isn't kept in the main branch tends to rot, especially if we start making larger changes. As far as I know, BLOOM is done, we just need to figure out what's causing it to fail generation. It's much easier for someone to pick it up and fix it if it's present; I think it's also a good way to ensure our abstractions work as expected as we increase the number of models. Apart from that, PR looks great! Well done :) Looking forward to merging this, I think it's a huge improvement. |
I personally think we should just merge and place a comment on the issue of it not working. I'd pick it up again but I don't have time for it till next week. |
Co-authored-by: @hhamud <[email protected]>
I've added back the BLOOM crate and tagged @hhamud as a co-author ✅ |
All that's left are the formatting and restoring bindings generation. Are you OK for me to do that, or do you have changes in the works? |
Please go for it - thank you so much 👯 |
Replaces #141. Includes code from #85 and #125.
llm
,llm-base
, model, andllm-cli
crates