vLLM FP8 quantized support for SFT/GRPO by Datta0 · Pull Request #313 · unslothai/unsloth-zoo

Datta0 · 2025-10-06T07:16:30Z

danielhanchen · 2025-10-14T13:14:26Z

        if extra_in_new:
-            for attr in sorted(extra_in_new):
-                print(f"EXTRA ATTRIBUTE: {name}.{attr} (exists in new model but not original)")
+            print(f'Found some extra attributes like: {list(extra_in_new)[:5]}...')


We're copying over quant method in some places. That would be the difference between expected vs created model. Didn't want to spam the console with every layer showing the same
Extra attribute quant method in model.model.layers.x

danielhanchen

Nice

danielhanchen

Small changes

Datta0 added 6 commits September 30, 2025 14:58

WIP vllm fp8

a859174

[WIP] Copy over the layer properly

4087791

fix shapes, scale init and quant type check

c3ea7cb

Prefer loading model from pretrained instead of config

fe546ec

compare only config dicts

9486be4

patch trainable

4d7c38d

Datta0 mentioned this pull request Oct 6, 2025

vLLM FP8 quantized support for SFT/GRPO unslothai/unsloth#3414

Merged

Datta0 added 3 commits October 6, 2025 12:47

Merge branch 'main' into vllm_fp8

471237e

Helper error and fix vllm logger import

3f4e5ef

Fixup vllm parameter type for layer creation

cbe8fd6

Datta0 force-pushed the vllm_fp8 branch from e936cd0 to cbe8fd6 Compare October 7, 2025 05:18

Datta0 added 7 commits October 7, 2025 11:11

[WIP] compressed linear fp8 instead of FP8Linear

a347a3a

[WIP] fbgemm fp8

553fd2c

Differentiate between fp8 and fbgemmfp8

2df5962

Make fbgemm_fp8 trainable

e55d1f0

Finish adding quant method attribute for fp8

234d172

Cleanup comparison logs

9ba7929

Fixup prefix naming and weight scale

c01f349

danielhanchen reviewed Oct 14, 2025

View reviewed changes

Comment thread unsloth_zoo/empty_model.py Outdated

danielhanchen reviewed Oct 14, 2025

View reviewed changes

Comment thread unsloth_zoo/empty_model.py Outdated

danielhanchen reviewed Oct 14, 2025

View reviewed changes

Comment thread unsloth_zoo/empty_model.py Outdated

danielhanchen reviewed Oct 14, 2025

View reviewed changes