Refactoring `convert-pth-to-ggml.py`: more concise and readable by qunash · Pull Request #109 · ggml-org/llama.cpp

qunash · 2023-03-14T00:59:06Z

No description provided.

SuajCarrot · 2023-03-16T16:05:59Z

Exactly what I was thinking, however I think a better approach regarding string concatenation for paths is using os.path.join instead simply to avoid typos either by the user or the programmer if the code changes in the future. Overall, LGTM.

ggerganov · 2023-03-19T17:22:39Z

@SuajCarrot

I get this error:

python3 convert-pth-to-ggml.py models/7B/ 1
{'dim': 4096, 'multiple_of': 256, 'n_heads': 32, 'n_layers': 32, 'norm_eps': 1e-06, 'vocab_size': -1}
n_parts = 1

Processing part 0

Processing variable: tok_embeddings.weight with shape: torch.Size([32000, 4096]) and type: torch.float16

Traceback (most recent call last):
  File "/Users/ggerganov/development/github/llama.cpp/convert-pth-to-ggml.py", line 157, in <module>
    main()
  File "/Users/ggerganov/development/github/llama.cpp/convert-pth-to-ggml.py", line 151, in main
    process_and_write_variables(fout, model, ftype)
  File "/Users/ggerganov/development/github/llama.cpp/convert-pth-to-ggml.py", line 127, in process_and_write_variables
    data.tofile(fout)
AttributeError: 'Tensor' object has no attribute 'tofile'. Did you mean: 'tile'?

Any ideas?

Edit: fixed

…-org#109) * Refactor get_n_parts function to simplify code and improve readability * Use f-strings instead of concatenation * Refactoring: more concise and readable * modularize --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* iq1_bn: improve CUDA TG On RTX-3080 TG-128(Bitnet-1.58b-3B) goes from 318 t/s to 340 t/s. I see I have on the front page 301 t/s, so pretty nice improvement since then. * iq2_bn(CUDA): quants are not 4-byte aligned --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

…-org#109) * Refactor get_n_parts function to simplify code and improve readability * Use f-strings instead of concatenation * Refactoring: more concise and readable * modularize --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

qunash added 4 commits March 14, 2023 01:50

Refactor get_n_parts function to simplify code and improve readability

94f368f

Use f-strings instead of concatenation

d8aba05

Refactoring: more concise and readable

c2af311

modularize

e1b1e12

qunash changed the title ~~Refactoring: more concise and readable~~ Refactoring convert-pth-to-ggml.py: more concise and readable Mar 14, 2023

Merge branch 'master' into master

c2577fd

sw mentioned this pull request Mar 18, 2023

improvement(tools): optimize convert-pth-to-ggml #232

Closed

gjmulder added the duplicate This issue or pull request already exists label Mar 18, 2023

ggerganov approved these changes Mar 19, 2023

View reviewed changes

Merge branch 'master' into master

6535332

ggerganov merged commit 467b149 into ggml-org:master Mar 19, 2023

ggerganov added a commit that referenced this pull request Mar 19, 2023

Fix python stuff (#109)

c1c7026

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

Fix python stuff (ggml-org#109)

c03de4d

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

Fix python stuff (ggml-org#109)

44e7e22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring `convert-pth-to-ggml.py`: more concise and readable#109

Refactoring `convert-pth-to-ggml.py`: more concise and readable#109
ggerganov merged 6 commits into
ggml-org:masterfrom
qunash:master

qunash commented Mar 14, 2023

Uh oh!

SuajCarrot commented Mar 16, 2023

Uh oh!

ggerganov commented Mar 19, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

qunash commented Mar 14, 2023

Uh oh!

SuajCarrot commented Mar 16, 2023

Uh oh!

ggerganov commented Mar 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ggerganov commented Mar 19, 2023 •

edited

Loading