py : fix internlm2-hf convert to gguf #5305

SolenoidWGT · 2024-02-03T14:21:42Z

InternLM2 has two problems in adapting llama.cpp, which are fixed by this PR:

The weights of q and k require additional reshape to be compatible with llama's inference interface..
For the chat model, we need to explicitly replace llama eos with internlm2 eos, so that the model can end the conversation normally.

The Prompt for reference.

[UNUSED_TOKEN_146]system\nYou are InternLM (书生·浦语), a helpful, honest, and harmless AI assistant developed by Shanghai AI Laboratory (上海人工智能实验室).[UNUSED_TOKEN_145]\n

User name

[UNUSED_TOKEN_146]user

Bot name

[UNUSED_TOKEN_146]assistant

Prompt template

{{prompt}}

{{history}} 
{{char}}:

Chat history template

{{name}}: 
{{message}} [UNUSED_TOKEN_145]

cc: @arch-btw @sweetcard

arch-btw · 2024-02-04T04:59:05Z

Thank you! Can confirm that it works with internlm2-chat-1_8b-sft

sweetcard

It can wok now. Thank you.👍

* py : fix internlm2-hf convert to gguf * ggml-ci

py : fix internlm2-hf convert to gguf

e59f825

SolenoidWGT mentioned this pull request Feb 3, 2024

llama : support InternLM2 #5184

Merged

ggml-ci

54dd7da

SolenoidWGT force-pushed the fix/internlm2_qk_shape branch from bf507c8 to 54dd7da Compare February 3, 2024 18:31

sweetcard reviewed Feb 5, 2024

View reviewed changes

ggerganov approved these changes Feb 5, 2024

View reviewed changes

ggerganov merged commit 7e1ae37 into ggml-org:master Feb 5, 2024

ggerganov mentioned this pull request Feb 5, 2024

convert.py couldn't convert internlm2 #5031

Closed

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

py : fix internlm2-hf convert to gguf (ggml-org#5305)

2578cb3

* py : fix internlm2-hf convert to gguf * ggml-ci

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

py : fix internlm2-hf convert to gguf (ggml-org#5305)

5ce64b4

* py : fix internlm2-hf convert to gguf * ggml-ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

py : fix internlm2-hf convert to gguf #5305

py : fix internlm2-hf convert to gguf #5305

Uh oh!

SolenoidWGT commented Feb 3, 2024 •

edited

Loading

Uh oh!

arch-btw commented Feb 4, 2024

Uh oh!

sweetcard left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

py : fix internlm2-hf convert to gguf #5305

py : fix internlm2-hf convert to gguf #5305

Uh oh!

Conversation

SolenoidWGT commented Feb 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arch-btw commented Feb 4, 2024

Uh oh!

sweetcard left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SolenoidWGT commented Feb 3, 2024 •

edited

Loading