Skip to content

Commit

Permalink
Fix inference issues FP8-trained models
Browse files Browse the repository at this point in the history
Signed-off-by: Jan Lasek <[email protected]>
  • Loading branch information
janekl committed Oct 1, 2024
1 parent 86408cc commit 56d6e6f
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions nemo/deploy/nlp/megatronllm_deployable.py
Original file line number Diff line number Diff line change
Expand Up @@ -157,6 +157,8 @@ def _load_from_nemo_checkpoint(self, nemo_checkpoint_filepath: str, num_devices:
custom_config.activations_checkpoint_granularity = None
custom_config.activations_checkpoint_method = None
custom_config.dist_ckpt_load_strictness = StrictHandling.LOG_ALL.value
if custom_config.get("fp8", False):
custom_config.fp8 = False

self.model = MegatronGPTModel.restore_from(
nemo_checkpoint_filepath, trainer=trainer, override_config_path=custom_config
Expand Down

0 comments on commit 56d6e6f

Please sign in to comment.