Skip to content

Conversation

@ggerganov
Copy link
Member

@ggerganov ggerganov commented Mar 21, 2023

  • Add test-tokenizer-0 to do a few tokenizations - feel free to expand
  • Added option to convert-pth-to-ggml.py script to dump just the vocabulary
  • Added ./models/ggml-vocab.bin containing just LLaMA vocab data (used for tests)
  • Added utility to load vocabulary file from previous point (temporary implementation)
  • Revert std::string_view changes and drop back to C++11
  • Rename gpt_vocab -> llama_vocab
  • All CMake binaries go into ./bin/ now

Need help to resolve Windows CI and merge

@ggerganov ggerganov closed this Mar 21, 2023
Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023
Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023
…t-build-core"

This reverts commit fb2c5f7, reversing
changes made to 202ed44.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant