Add way of skipping pretrained weights download #5172

epwalsh · 2021-04-30T18:13:57Z

Changes proposed in this pull request:

Adds a load_weights: bool (default = True) parameter to cached_transformers.get() and all higher-level modules that call this function, such as PretrainedTransformerEmbedder and PretrainedTransformerMismatchedEmbedder. Setting this parameter to False will avoid downloading and loading pretrained transformer weights, so only the architecture is instantiated. So you can set the parameter to False via the overrides parameter when loading an AllenNLP model/predictor from an archive to avoid an unnecessary download.

For example, suppose your training config looks something like this:

{
  "model": {
    "type": "basic_classifier",
    "text_field_embedder": {
      "tokens": {
        "type": "pretrained_transformer",
        "model_name": "bert-base-cased",
        // ... other stuff ...
      }
    },
  },
  // ... other stuff ...
}

And now you have an archive from training this model: model.tar.gz. Then you can load the trained model into a predictor like so:

from allennlp.predictors import Predictor

overrides = {"model.text_field_embedder.tokens.load_weights": False}
predictor = Predictor.from_path("model.tar.gz", overrides=overrides)

epwalsh · 2021-04-30T18:15:31Z

Unfortunately this actually doesn't address #5170, because the SrlBert model uses the transformers library directly. But that's not hard to fix. I'll follow up with a separate PR for that in allennlp-models.

ArjunSubramonian

Awesome! looks great to me. thanks for meticulously going down the stack :)

* add way of skipping pretrained weights download * clarify docstring * add link to PR in CHANGELOG

epwalsh added 2 commits April 30, 2021 09:09

add way of skipping pretrained weights download

f31c2b6

clarify docstring

48cbb05

epwalsh requested review from ArjunSubramonian and AkshitaB April 30, 2021 18:42

epwalsh mentioned this pull request Apr 30, 2021

Add way to initialize SrlBert without pretrained BERT weights allenai/allennlp-models#257

Merged

add link to PR in CHANGELOG

f2b2afa

ArjunSubramonian approved these changes Apr 30, 2021

View reviewed changes

epwalsh merged commit a463e0e into main May 2, 2021

epwalsh deleted the transformer-no-load-weights branch May 2, 2021 21:51

dirkgr pushed a commit that referenced this pull request May 10, 2021

Add way of skipping pretrained weights download (#5172)

b234edd

* add way of skipping pretrained weights download * clarify docstring * add link to PR in CHANGELOG

epwalsh mentioned this pull request Nov 15, 2021

Predictor.from_path('coref-spanbert-large-2021.03.10.tar.gz') downloads model into cache though I provide a local copy of the model #5448

Closed

Frost45 mentioned this pull request Mar 14, 2023

Update embedder to not download weights from huggingface EducationalTestingService/gector#19

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add way of skipping pretrained weights download #5172

Add way of skipping pretrained weights download #5172

epwalsh commented Apr 30, 2021

epwalsh commented Apr 30, 2021 •

edited

Loading

ArjunSubramonian left a comment

Add way of skipping pretrained weights download #5172

Add way of skipping pretrained weights download #5172

Conversation

epwalsh commented Apr 30, 2021

epwalsh commented Apr 30, 2021 • edited Loading

ArjunSubramonian left a comment

Choose a reason for hiding this comment

epwalsh commented Apr 30, 2021 •

edited

Loading