change quant conf example to use act_maxabs_pow2_weights_pcs_opt_pow2#763
change quant conf example to use act_maxabs_pow2_weights_pcs_opt_pow2#763HolyFalafel wants to merge 2 commits into
Conversation
|
@HolyFalafel What's the difference between both methods? |
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
The method we changed into gave us better accuracy |
|
Thanks for the explanation! Your branch is quite far from the head of the main branch (it still relies on Transformers v4.34), could you rebase it so that I can test it please? |
|
I wanted to minimize the number of commits. I'll rebase |
|
Just to make sure I test it correctly, I first need to measure with and then run the quantized model with right? |
|
The second command returns this error: Is it supposed to work with v1.14 or should I wait for v1.15? |
Yes, it's a change done in v1.15, so let's wait for it
This looks correct. @bgoldberg-habana any remarks? |
|
@regisss Please notice that we measure with run_lm_eval.py to catch more data. Measurement: Per Tensor Quantization (PTQ): Per Channel Quantization (PCQ): You can also run on 7B |
regisss
left a comment
There was a problem hiding this comment.
LGTM!
Waiting for the release of Synapse v1.15 to merge.
|
@HolyFalafel I guess we can close this PR as this change was present in #765 too? |
Right |
change quant conf example to use act_maxabs_pow2_weights_pcs_opt_pow2