questions for loading the pretrained_model #21

mingbocui · 2019-11-12T11:05:13Z

    def load(self, model_file, pretrain_file):
        """ load saved model or pretrained transformer (a part of model) """
        if model_file:
            print('Loading the model from', model_file)
            self.model.load_state_dict(torch.load(model_file))

        elif pretrain_file: # use pretrained transformer
            print('Loading the pretrained model from', pretrain_file)
            if pretrain_file.endswith('.ckpt'): # checkpoint file in tensorflow
                checkpoint.load_model(self.model.transformer, pretrain_file)
            elif pretrain_file.endswith('.pt'): # pretrain model file in pytorch
                self.model.transformer.load_state_dict(
                    {key[12:]: value
                        for key, value in torch.load(pretrain_file).items()
                        if key.startswith('transformer')}
                ) # load only transformer parts

Could I kindly ask that what is the meaning of key[12:]: value when you load a pretrained_model? Just want to keep the last layer? Thanks, hope for your reply.

The text was updated successfully, but these errors were encountered:

dhlee347 · 2019-11-14T07:12:47Z

It is because I wanted to load only a transformer part of saved model, not the whole model.

mingbocui · 2019-11-29T18:41:05Z

@dhlee347 thanks for your reply. I have one more question, if I change the number of BERT layers from 12 to 6, should I change the key[12:] to key[6:]?

mingbocui closed this as completed Nov 18, 2019

mingbocui reopened this Nov 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions for loading the pretrained_model #21

questions for loading the pretrained_model #21

mingbocui commented Nov 12, 2019

dhlee347 commented Nov 14, 2019

mingbocui commented Nov 29, 2019

questions for loading the pretrained_model #21

questions for loading the pretrained_model #21

Comments

mingbocui commented Nov 12, 2019

dhlee347 commented Nov 14, 2019

mingbocui commented Nov 29, 2019