llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Stillerman · 2024-07-20T23:01:32Z

Adding SmolLM pre-tokenizer support for SmolLM models.

Added tokenizer type for SmolLM-135M in convert-hf-to-gguf-update.py
Added the chkhsh for SmolLM-135M in convert-hf-to-gguf.py
Added LLAMA_VOCAB_PRE_TYPE_SMOLLM enum to llama.h

Ran ./tests/test-tokenizer-0.sh smollm ./models/ggml-vocab-smollm.gguf and Tests passed.

Thank you @m18coppola for #8579

@loubnabnl @anton-l Does src/llama.cpp look right? Any special settings needed in the tokenizer? The .gguf I created for SmolLM-135M seemed to inference well.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

convert_hf_to_gguf_update.py

src/llama.cpp

Co-authored-by: compilade <[email protected]>

models/ggml-vocab-smollm.gguf.inp

Vaibhavs10

Tested this! Seems to work well for me too!

eliebak · 2024-07-22T14:44:55Z

Seems to work well for me too @Stillerman, thanks a lot for adding support to our model! 🤗

spergware · 2024-07-26T23:20:26Z

The 1.7B model prints out nonsense, isn't that supported? Asking for clarification.

* Adding SmolLM Pre Tokenizer * Update convert_hf_to_gguf_update.py Co-authored-by: compilade <[email protected]> * Update src/llama.cpp Co-authored-by: compilade <[email protected]> * handle regex * removed .inp and out .out ggufs --------- Co-authored-by: compilade <[email protected]>

Adding SmolLM Pre Tokenizer

28bd56f

github-actions bot added the python python script changes label Jul 20, 2024

compilade reviewed Jul 21, 2024

View reviewed changes

convert_hf_to_gguf_update.py Outdated Show resolved Hide resolved

src/llama.cpp Show resolved Hide resolved

src/llama.cpp Show resolved Hide resolved

Stillerman and others added 2 commits July 21, 2024 02:48

Update convert_hf_to_gguf_update.py

7647916

Co-authored-by: compilade <[email protected]>

Update src/llama.cpp

689e38c

Co-authored-by: compilade <[email protected]>

Stillerman mentioned this pull request Jul 21, 2024

Supports SmolLM Mozilla-Ocho/llamafile#495

Merged

handle regex

f4600e6

ggerganov approved these changes Jul 21, 2024

View reviewed changes

models/ggml-vocab-smollm.gguf.inp Outdated Show resolved Hide resolved

ngxson linked an issue Jul 21, 2024 that may be closed by this pull request

Support for SmolLM #8608

Closed

4 tasks

removed .inp and out .out ggufs

525e789

Vaibhavs10 approved these changes Jul 22, 2024

View reviewed changes

ggerganov merged commit d94c6e0 into ggml-org:master Jul 22, 2024

This was referenced Jul 22, 2024

llama : move vocab, grammar and sampling into separate files #8508

Merged

llama : fix codeshell support (#8250) #8599

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Uh oh!

Stillerman commented Jul 20, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Vaibhavs10 left a comment

Uh oh!

eliebak commented Jul 22, 2024

Uh oh!

spergware commented Jul 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Uh oh!

Conversation

Stillerman commented Jul 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Uh oh!

eliebak commented Jul 22, 2024

Uh oh!

spergware commented Jul 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Stillerman commented Jul 20, 2024 •

edited

Loading