-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for BERT embedding models #5423
Merged
+616
−52
Merged
Changes from 18 commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
7286b83
BERT WIP
cebtenzzre ef10d78
merge from master
iamlemec 0051c82
it runs; tokenization is messed up; pooling is wrong for multi batches
iamlemec 59c1829
add in wordpiece tokenizer
iamlemec 5f1c21d
put causal_attn flag in gguf
iamlemec e0e14e3
Merge remote-tracking branch 'origin/master' into bert
iamlemec 7218c7b
Merge remote-tracking branch 'upstream/master' into bert
iamlemec e3efcf1
Update convert-hf-to-gguf.py
iamlemec 96d37f8
add causal attention gguf key
iamlemec e78388d
use ctx_output for tok_norm of BERT and BLOOM
cebtenzzre b14c457
bert : add some missing graph callbacks
cebtenzzre 6875808
fix up model sizing and result acquisition
iamlemec d080beb
hard-code token_type = 0
iamlemec 3a1895d
Merge branch 'bert' of github.com:iamlemec/llama.cpp into bert
iamlemec 961e98f
style fixes
iamlemec 56afb2f
undo attempted type_embd simplify
iamlemec ab49e9e
bert : simplify token type embedding access
cebtenzzre 6972e7e
flake8 : add W503 to ignore list
cebtenzzre 8fbefed
minor : code style normalization
ggerganov e379e8c
avoid use of ggml_graph_get_tensor
iamlemec 61bab47
Merge branch 'bert' of github.com:iamlemec/llama.cpp into bert
iamlemec File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,3 @@ | ||
[flake8] | ||
max-line-length = 125 | ||
ignore = W503 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note to self: resolve this before merge
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
have you... have you forgotten about this...