[Model][Hardware][NV] Add support for ModelOpt static scaling checkpoints by pavanimajety · Pull Request #5387 · vllm-project/vllm

pavanimajety · 2024-06-10T18:16:47Z

This change adds support for running ModelOpt FP8 checkpoints. The change converts the names of keys from ModelOpt to vLLM recognized key names in FP8 quantization mode.

…ints

robertgshaw2-redhat · 2024-06-10T18:55:06Z

+        """Replaces the names of *quantizer._amax to _scale."""
+        replacements = {
+            "weight_quantizer._amax": "weight_scale",
+            "input_quantizer._amax": "act_scale",


We just tweaked this to input_scale FYI ahead of the v0.5.0 beta launch

robertgshaw2-redhat · 2024-06-10T18:56:39Z

Does this have to be implemented in llama.py or could this logic be generic to all models and implemented in our existing FP8Linearmethod?

mgoin · 2024-06-10T18:57:17Z

+        """Replaces the names of *quantizer._amax to _scale."""
+        replacements = {
+            "weight_quantizer._amax": "weight_scale",
+            "input_quantizer._amax": "act_scale",


This has been updated such that act_scale -> input_scale #5353

mgoin · 2024-06-10T18:58:51Z

+        weights_to_convert = []
+        vllm_state_dict = {}
+        for key, value in input_state_dict.items():
+            if key.endswith("_amax"):


It would be best to make this as specific as possible to avoid possible conflicts -- would if key.endswith("_quantizer._amax"): work?

cjluo-nv · 2024-06-10T20:22:10Z

+        else:
+            return key, value
+
+    def _convert_ammo_weights(self, input_state_dict: Dict[str, torch.tensor]):


ammo is no the product name. Let's use modelopt instead.

[Model][Hardware][NV] Add support for ModelOpt static scaling checkpo…

416dc98

…ints

robertgshaw2-redhat reviewed Jun 10, 2024

View reviewed changes

mgoin reviewed Jun 10, 2024

View reviewed changes

cjluo-nv reviewed Jun 10, 2024

View reviewed changes

pavanimajety closed this Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model][Hardware][NV] Add support for ModelOpt static scaling checkpoints#5387

[Model][Hardware][NV] Add support for ModelOpt static scaling checkpoints#5387
pavanimajety wants to merge 1 commit intovllm-project:mainfrom
pavanimajety:read_ammo_chkpoint

pavanimajety commented Jun 10, 2024

Uh oh!

robertgshaw2-redhat Jun 10, 2024

Uh oh!

robertgshaw2-redhat commented Jun 10, 2024 •

edited

Loading

Uh oh!

mgoin Jun 10, 2024

Uh oh!

mgoin Jun 10, 2024

Uh oh!

cjluo-nv Jun 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

pavanimajety commented Jun 10, 2024

Uh oh!

robertgshaw2-redhat Jun 10, 2024

Choose a reason for hiding this comment

Uh oh!

robertgshaw2-redhat commented Jun 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgoin Jun 10, 2024

Choose a reason for hiding this comment

Uh oh!

mgoin Jun 10, 2024

Choose a reason for hiding this comment

Uh oh!

cjluo-nv Jun 10, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

robertgshaw2-redhat commented Jun 10, 2024 •

edited

Loading