convert.py : add rope_freq_base when converting CodeLlama from an HF model #2773

slaren · 2023-08-24T20:17:25Z

Note that this parameter is still not officially supported by HF, and the name may change in the future.

@TheBloke Let me know if this works for you.

TheBloke · 2023-08-24T23:26:50Z

Thanks very much!

And it looks like rope_theta is now official. HF PR for LlamaCode has come out: https://github.com/huggingface/transformers/pull/25740/files

* master: (773 commits) server : add `/detokenize` endpoint (ggerganov#2802) convert.py : advanced option (ggerganov#2753) llama : use Unicode Escape Sequence to replace encoded characters (ggerganov#2814) flake.nix : add rocm support and cleanup (ggerganov#2808) llama : move #includes out of _GNU_SOURCE conditional (ggerganov#2817) main : fix bug (penalize_nl=false doesn't work) + suppress warning on mingw (ggerganov#1528) llama : use std::abs in llama_sample_tail_free (ggerganov#2800) k-quants : remove unnecessary tensor shape restrictions (ggerganov#2811) Better perplexity for 2- and 3-bit quantization for LLaMA-v2-70B (ggerganov#2807) Fix HellaSwag (ggerganov#2805) flake : build llama.cpp on Intel with nix (ggerganov#2795) Handle null rope scaling value (ggerganov#2793) Fix spm whitespaces (ggerganov#2806) examples : skip unnecessary external lib in server README.md how-to (ggerganov#2804) llama : fix struct decl (ggerganov#2790) Faster perplexity computation (ggerganov#2786) llama : add llama_beam_search() (ggerganov#2267) convert.py : Get rope scale from HuggingFace models (ggerganov#2772) llama-bench : add model sizes (ggerganov#2771) convert.py : export rope freq_base when converting CodeLlama from an HF model (ggerganov#2773) ...

…HF model (ggerganov#2773)

convert.py : add freq_base when converting CodeLlama from an HF model

06f7925

slaren marked this pull request as ready for review August 24, 2023 23:50

ggerganov approved these changes Aug 25, 2023

View reviewed changes

slaren merged commit 12e2e33 into master Aug 25, 2023
3 checks passed

slaren deleted the codellama-hf-freq-base branch August 25, 2023 12:08

akawrykow pushed a commit to akawrykow/llama.cpp that referenced this pull request Aug 29, 2023

convert.py : export rope freq_base when converting CodeLlama from an …

d4835b8

…HF model (ggerganov#2773)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert.py : add rope_freq_base when converting CodeLlama from an HF model #2773

convert.py : add rope_freq_base when converting CodeLlama from an HF model #2773

slaren commented Aug 24, 2023

TheBloke commented Aug 24, 2023

convert.py : add rope_freq_base when converting CodeLlama from an HF model #2773

convert.py : add rope_freq_base when converting CodeLlama from an HF model #2773

Conversation

slaren commented Aug 24, 2023

TheBloke commented Aug 24, 2023