[Usage]: Loading a model with bitsandbytes quantization with 8bit #8720

IkhlasAlhussien · 2024-09-23T00:10:41Z

How can I load a model using bitsandbytes quantization in 8-bit format? I'm currently loading the model with the following code:

model_id = "path/to/model"
llm = LLM(model=model_id, dtype=torch.bfloat16, trust_remote_code=True, \
quantization="bitsandbytes", load_format="bitsandbytes")

This loads the model in 4-bit format, but I can't figure out how to load it in 8-bit. What should I change to load the model in 8-bit instead?

github-actions · 2024-12-22T02:03:38Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

github-actions · 2025-01-22T01:59:58Z

This issue has been automatically closed due to inactivity. Please feel free to reopen if you feel it is still relevant. Thank you!

IkhlasAlhussien added the usage How to use vllm label Sep 23, 2024

IkhlasAlhussien changed the title ~~[Usage]: Loading a model with BitsandBytes quantization with 8bit~~ [Usage]: Loading a model with bitsandbytes quantization with 8bit Sep 23, 2024

molereddy mentioned this issue Nov 27, 2024

[Bug]: Loading a model with bitsandbytes 8bit quantization #8799

Open

1 task

github-actions bot added the stale label Dec 22, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage]: Loading a model with bitsandbytes quantization with 8bit #8720

[Usage]: Loading a model with bitsandbytes quantization with 8bit #8720

IkhlasAlhussien commented Sep 23, 2024 •

edited

Loading

github-actions bot commented Dec 22, 2024

github-actions bot commented Jan 22, 2025

[Usage]: Loading a model with bitsandbytes quantization with 8bit #8720

[Usage]: Loading a model with bitsandbytes quantization with 8bit #8720

Comments

IkhlasAlhussien commented Sep 23, 2024 • edited Loading

github-actions bot commented Dec 22, 2024

github-actions bot commented Jan 22, 2025

IkhlasAlhussien commented Sep 23, 2024 •

edited

Loading