cross platform from_pretrained #20538

ArthurZucker · 2022-12-01T18:48:00Z

What does this PR do?

Allows loading sharded checkpoints in TF models. Should fix #19965

from_pt=True
from_flax=True

cc @sgugger just FYI

ArthurZucker · 2022-12-01T18:58:36Z

Works great for sharded pytorch since a utility was already implemented. Though we are not gonna push for Flax, would still help to have the support already!

from transformers import TFT5ForConditionalGeneration
MODEL_NAME = "google/flan-t5-xl"
m = TFT5ForConditionalGeneration.from_pretrained(MODEL_NAME, from_pt=True)

HuggingFaceDocBuilderDev · 2022-12-01T19:03:50Z

The documentation is not available anymore as the PR was closed or merged.

src/transformers/modeling_tf_flax_utils.py

src/transformers/modeling_tf_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…transformers into tf-from-sharded-pt

ArthurZucker · 2022-12-02T16:40:53Z

Just need to remove the # TODOs

sgugger · 2022-12-05T13:58:13Z

src/transformers/modeling_tf_utils.py

-            elif os.path.isfile(pretrained_model_name_or_path):
-                archive_file = pretrained_model_name_or_path
-                is_local = True
-            elif os.path.isfile(pretrained_model_name_or_path + ".index"):
-                archive_file = pretrained_model_name_or_path + ".index"
-                is_local = True


This code shouldn't be removed, to preserve compatibility with PreTrainedModel.from_pretrained(path_to_a_model_path, config = config)

sgugger

Thanks!

* add support for `from_pt` * add tf_flax utility file * Update src/transformers/modeling_tf_flax_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove flax related modifications * add test * remove FLAX related commits * fixup * remove safetensor todos * revert deletion Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

add support for from_pt

60c0ee6

add tf_flax utility file

cf9fbb3

sgugger reviewed Dec 1, 2022

View reviewed changes

src/transformers/modeling_tf_flax_utils.py Outdated Show resolved Hide resolved

src/transformers/modeling_tf_utils.py Outdated Show resolved Hide resolved

src/transformers/modeling_tf_utils.py Show resolved Hide resolved

ArthurZucker and others added 4 commits December 2, 2022 09:02

Update src/transformers/modeling_tf_flax_utils.py

e924d1d

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

remove flax related modifications

960e145

add test

437db80

Merge branch 'tf-from-sharded-pt' of https://github.com/ArthurZucker/…

b7b645a

…transformers into tf-from-sharded-pt

ArthurZucker marked this pull request as ready for review December 2, 2022 15:31

remove FLAX related commits

22cf129

ArthurZucker added 2 commits December 2, 2022 16:46

fixup

fa1d480

remove safetensor todos

2bf1f31

ArthurZucker requested a review from sgugger December 2, 2022 17:10

sgugger reviewed Dec 5, 2022

View reviewed changes

revert deletion

a43dbc0

sgugger approved these changes Dec 5, 2022

View reviewed changes

ArthurZucker merged commit 84c9bf7 into huggingface:main Dec 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cross platform from_pretrained #20538

cross platform from_pretrained #20538

Uh oh!

ArthurZucker commented Dec 1, 2022 •

edited

Loading

Uh oh!

ArthurZucker commented Dec 1, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Dec 1, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Dec 2, 2022

Uh oh!

sgugger Dec 5, 2022

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cross platform from_pretrained #20538

cross platform from_pretrained #20538

Uh oh!

Conversation

ArthurZucker commented Dec 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

ArthurZucker commented Dec 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Dec 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Dec 2, 2022

Uh oh!

sgugger Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArthurZucker commented Dec 1, 2022 •

edited

Loading

ArthurZucker commented Dec 1, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 1, 2022 •

edited

Loading