Conversation
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
|
👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review. Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed. |
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
dsikka
requested changes
Feb 8, 2025
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
dsikka
reviewed
Feb 9, 2025
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
rahul-tuli
approved these changes
Feb 10, 2025
dsikka
approved these changes
Feb 11, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
targetsspecifies which modules to sparsity, not which layers to target_infer_owl_layer_sparsityand add testnum_logits_to_keepas Tensor and change it tologits_to_keep+ add flag huggingface/transformers#35757Changes
targetsandsequential_targets_infer_owl_layer_sparsityand add testcalibrate_moduleas an abstract method on the sgpt mixinmaybe_inject_pos_embeddingsto sequential pipeline to hackily support models withposition_embeddingson_sequential_batch_endto call on the end of epoch, rather than every batchFollowups
sequential_updateoption from examples and testsTesting
tests/llmcompressor/transformers/obcq/test_obcq_owl.pyRegression Evaluations
Models were compressed using
examples/sparse_2of4_quantization_fp8/llama3_8b_2of4.pywithout fp8 optionsparsegpt
Main
This branch
To test wanda, the
SparseGPTModifierwas replaced with theWandaPruningModifierwanda
Main
This branch