Pad the examples for QLoRa finetuning test#1941
Conversation
* [SW-226132] Pad the examples * update test name --------- Co-authored-by: Vivek Goel <vgoel@habana.ai>
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
LGTM. @regisss can we add these tests to slow_tests you run for G2/G3? This will help us avoid regressions. We can publish QLoRA support as experimental/beta future on G2/G3 (works for limited configurations and performance is not great). Note this PR is dependent on Synapse 1.21.0 release (not backward compatible).
|
Support for NF4 quantization/dequantization using Intel Gaudi hardware: bitsandbytes-foundation/bitsandbytes#1592 |
|
We're starting to see We've run |
|
... ... appears to cause the error to show up during test run. |
|
Possibly related to old peft version in |
|
This error is coming from bitsandbytes, it should not be related to peft. I've run other tests locally (test_bnb_qlora.py, test_bnb_inference.py), and I didn't face this issue. Let me check it again. |
|
In case of HPU, bitsandbytes loads the CPU binaries (https://github.com/bitsandbytes-foundation/bitsandbytes/blob/multi-backend-refactor/bitsandbytes/cextension.py#L73), which are not required. In short,
|
|
Fix for the above issue: |
Additional updates
6. Enable the eager mode for the test (disable the torch.compile mode for now).
7. Add new requirement for installing the bitsandbytes (from https://github.com/bitsandbytes-foundation/bitsandbytes/tree/multi-backend-refactor)