Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support llama3 tokenizer #67

Merged
merged 9 commits into from
May 1, 2024
Merged

Conversation

bhavya01
Copy link
Collaborator

@bhavya01 bhavya01 commented May 1, 2024

No description provided.

@bhavya01 bhavya01 marked this pull request as ready for review May 1, 2024 17:35
@bhavya01 bhavya01 requested a review from vipannalla as a code owner May 1, 2024 17:35
Copy link
Collaborator

@vipannalla vipannalla left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good.

@@ -0,0 +1,235 @@
# Copyright (c) Meta Platforms, Inc. and affiliates.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you move this file to third_part/llama2? It would be better to separate google code vs third party code.

jetstream/engine/token_utils.py Show resolved Hide resolved
@bhavya01 bhavya01 requested a review from FanhaiLu1 May 1, 2024 18:46
jetstream/engine/token_utils.py Show resolved Hide resolved
@bhavya01 bhavya01 merged commit 2db6c14 into AI-Hypercomputer:main May 1, 2024
3 checks passed
@bhavya01 bhavya01 deleted the llama3 branch May 1, 2024 20:45
jwyang-google pushed a commit that referenced this pull request May 6, 2024
* Support llama3 tokenizer

* Add tiktoken to requirements

* Add blobfile to requirements

* Fix unit tests

* Fix linting issues

* Fix pytype errors

* Move llama3 tokenizer to third_party directory

* Fix pytype error

* Update pytype command
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants