convert : remove input_scale for dequantized fp8 modelopt by CISC · Pull Request #22356 · ggml-org/llama.cpp

CISC · 2026-04-25T13:42:43Z

Overview

Fixes #22346

Additional information

Refactors scale tensor writing into reusable methods and removes input_scale for dequantized FP8 modelopt tensors.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: wala

danbev

Verified locally that the model conversion works 👍

…2356)

CISC added 2 commits April 25, 2026 15:27

support input_scale for fp8 modelopt

46d4461

tuple fix

f730d48

github-actions Bot added the python python script changes label Apr 25, 2026

remove input_scale for dequantized tensors

fed918a

CISC changed the title ~~convert : support input_scale for fp8 modelopt~~ convert : remove input_scale for dequantized fp8 modelopt Apr 26, 2026

danbev approved these changes Apr 27, 2026

View reviewed changes

ggerganov approved these changes Apr 27, 2026

View reviewed changes

CISC merged commit d13540b into master Apr 27, 2026
9 checks passed

CISC deleted the cisc/convert-modelopt-input-scale-fp8 branch April 27, 2026 06:45

IntelNav pushed a commit to IntelNav/llama.cpp that referenced this pull request Apr 29, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

1f6c4d6

…2356)

IntelNav pushed a commit to IntelNav/llama.cpp that referenced this pull request Apr 29, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

6d89ed6

…2356)

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

c198bd7

…2356)

samuraieng pushed a commit to samuraieng/llama.cpp that referenced this pull request May 6, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

0782c7f

…2356)

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

982cef6

…2356)

meh pushed a commit to meh/llama.cpp that referenced this pull request May 10, 2026

convert : remove input_scale for dequantized fp8 modelopt (ggml-org#2…

fab1256

…2356)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert : remove input_scale for dequantized fp8 modelopt#22356

convert : remove input_scale for dequantized fp8 modelopt#22356
CISC merged 3 commits into
masterfrom
cisc/convert-modelopt-input-scale-fp8

CISC commented Apr 25, 2026 •

edited

Loading

Uh oh!

danbev left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CISC commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

danbev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CISC commented Apr 25, 2026 •

edited

Loading