Add support for Opt in tf and flax #17226

ArthurZucker · 2022-05-13T06:48:30Z

What does this PR do?

Adds support for OPT in Flax and TF.
Also clean Pytorch code a bit.

Who can review?

@LysandreJik, @patrickvonplaten, @patil-suraj, @sgugger

- putting use cache to False

- remove commented block

- remove unecessary files

- remove a test file - added the logits test

Co-authored-by: Patrick von Platen <[email protected]>

- rm mask filling example on docstring

- remove useless args

- more tests should pass now - needs to clean more - documentation still needs to be done

- change attention architecture to BART-like - modify some tests - style fix

- remove opt for: - QA - cond generation - seq classif

TOkenizers are not implemented

Co-authored-by: Arthur <[email protected]>

…zer")

``` ...).unsqueeze(``` by ``` >>>).unsqueeze(```

…izers for Training (huggingface#17154) * add torch SGD and Adagrad optimizer bits * refine naming Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

…e_dummy_inputs (huggingface#17105) * propagate attention_mask dtype * fixup&style

Co-authored-by: kuanwee.heng <[email protected]>

* Create RetriBERT tests folder * Add missing RetriBERT tokenizer test file * Apply style corrections * Add non-english filter * Update tests/retribert/test_tokenization_retribert.py Co-authored-by: SaulLu <[email protected]> * Update tests/retribert/test_tokenization_retribert.py Co-authored-by: SaulLu <[email protected]> * Move test files to new directory * Update import path for testing utils to new test file structure Co-authored-by: SaulLu <[email protected]>

…6907) * add seed worker and set_deterministic_seed_for_cuda function to enforce reproducability * change function name to enable determinism, add docstrings, reproducability support for tf * change function name to enable_determinism_for_distributed_training * revert changes in set_seed and call set_seed within enable_full_determinism * add one position argument for seed_worker function * add full_determinism flag in training args and call enable_full_determinism when it is true * add enable_full_determinism to documentation * apply make fixup after the last commit * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

…gface#17166) * Remove unneeded columns for IterableDataset * Add test * Update trainer tests * Edit docstring * Lint * Apply feedback * Apply feedback

* Fix markdown code block * Use consistent spelling for self-attention * Fix typos and phrasing * Fix code style

* Ensure tensors are at least 1d for pad and concat * Compatibility * Fix * Fix * Add test * Retrigger CI * Consistency with master * Retrigger CI

* [WIP] Add FLAVA model This PR aims to add [FLAVA](ihttps://arxiv.org/abs/2112.04482) model to the transformers repo. Following checklist delineates the list of things to be done for this PR to be complete: [x] Flava init [x] Flava base models [x] Flava layers [x] Flava Configs [x] Flava encoders [x] Flava pretraining models [ ] Flava classification/retrieval models (To be added in a separate PR) [x] Documentation updates [x] Imports updates [x] Argstring updates [x] Flava pretrained checkpoints [x] Flava tests [x] Flava processors [x] Sanity check [x] Lint

…16922) * adding philosophy.mdx translation to Spanish * adding philosophy.mdx translation to Spanish * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/philosophy.mdx Co-authored-by: Omar U. Espejel <[email protected]> * philosophy translation to Spanish * Update _toctree.yml * Update _toctree.yml * nits Co-authored-by: Omar U. Espejel <[email protected]>

* Spanish version of language_modeling.mdx doc file * modification to toctree.yml file * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/language_modeling.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Correct position of Guías conceptuales Co-authored-by: Omar U. Espejel <[email protected]>

…e#16882) * Spanish translation of fast_tokenizers.mdx * add fast_tokenizers to the spanish _toctree.yml * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> * Update docs/source/es/fast_tokenizers.mdx Co-authored-by: Omar U. Espejel <[email protected]> Co-authored-by: Omar U. Espejel <[email protected]>

…xamples (huggingface#16685) * Change nits in Spanish for quicktour.mdx - Add tasks names in English too. - Fix small nits in Spanish * Translate index.mdx to Spanish * Translate body of index. * Translated the compatible models list (not the papers´ names). Since this should not be updated manually, I can come back to the original text. * Add models and a dataset for Spanish in the code exmaples * Replaced the English models to Spanish versions. * Add index to _toctree.yml and fix Spanish * Fix double ““ error * Change negative example in ASR example * make style * Debug style in quicktour.mdx

* Fix contents in index.mdx to match docs' sidebar * Eliminates api section from contents

…to opt-tf-flax

ArthurZucker · 2022-05-13T07:35:45Z

Closing for a new pull request where history is fixed

younesbelkada and others added 30 commits May 4, 2022 15:18

First version - OPT model

c8cf718

Final changes

9ee623d

- putting use cache to False

few changes

0484ca1

- remove commented block

few changes

b931db8

- remove unecessary files

fix style issues

681dfc5

few changes

1e21983

- remove a test file - added the logits test

Update src/transformers/models/auto/tokenization_auto.py

1363221

Co-authored-by: Patrick von Platen <[email protected]>

add gen tests

8427279

few changes

5e8e2f5

- rm mask filling example on docstring

few changes

be0e434

- remove useless args

some changes

51db79e

- more tests should pass now - needs to clean more - documentation still needs to be done

fix code quality

99001d3

major changes

a777bbc

- change attention architecture to BART-like - modify some tests - style fix

rm useless classes

38f7463

- remove opt for: - QA - cond generation - seq classif

Removed autodoc calls to non-existant classes

c6f3a69

TOkenizers are not implemented

Update src/transformers/__init__.py

30d3db2

Co-authored-by: Arthur <[email protected]>

Update src/transformers/__init__.py

f903445

Co-authored-by: Arthur <[email protected]>

Update src/transformers/models/auto/modeling_tf_auto.py

bb4ab4a

Co-authored-by: Arthur <[email protected]>

Replaced OPTTokeniser with GPT2 tokenizer

2a6e288

added GPT2Tokenizer.from_pretrained("patrickvonplaten/opt_gpt2_tokeni…

cb853fd

…zer")

Removed OPTTokenizer

337e71f

make style

0d9130f

Make style replaces

290b7f0

``` ...).unsqueeze(``` by ``` >>>).unsqueeze(```

make repo consistency

096eb74

Removed PretrainedOPTModel

020843a

fix opt.mdx removed other heads

c63d9f8

fix init, removed 3 heads

8b6e496

removed heads

0303f2b

finished cleaning head

2c0327d

removed seauence classif and question answering

4aa6ab2

jianan-gu and others added 24 commits May 12, 2022 15:30

Extend Transformers Trainer Class to Enable PyTorch SGD/Adagrad Optim…

f537df3

…izers for Training (huggingface#17154) * add torch SGD and Adagrad optimizer bits * refine naming Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generat…

5ca32d7

…e_dummy_inputs (huggingface#17105) * propagate attention_mask dtype * fixup&style

Fix repo consistency

6d54aef

Convert image to rgb for clip model (huggingface#17101)

cd5d51c

Co-authored-by: kuanwee.heng <[email protected]>

Remove unnecessary columns for all dataset types in Trainer (huggin…

1b98a3e

…gface#17166) * Remove unneeded columns for IterableDataset * Add test * Update trainer tests * Edit docstring * Lint * Apply feedback * Apply feedback

Fix LED documentation (huggingface#17181)

5d2880a

* Fix markdown code block * Use consistent spelling for self-attention * Fix typos and phrasing * Fix code style

Ensure tensors are at least 1d for pad and concat (huggingface#17179)

320cc6d

* Ensure tensors are at least 1d for pad and concat * Compatibility * Fix * Fix * Add test * Retrigger CI * Consistency with master * Retrigger CI

add shift_tokens_right in FlaxMT5 (huggingface#17188)

69fde79

Remove columns before passing to data collator (huggingface#17187)

b591cfb

Remove duplicated os.path.join (huggingface#17192)

6c22b82

Fix style error in Spanish docs (huggingface#17197)

47e9918

Fix contents in index.mdx to match docs' sidebar (huggingface#17198)

5518389

* Fix contents in index.mdx to match docs' sidebar * Eliminates api section from contents

finish tests

c48413d

add some good tokenizer tests

1341b5f

update flax code

4384047

update and clean

02b2500

Merge branch 'main' of https://github.com/huggingface/transformers in…

e2cbaf0

…to opt-tf-flax

ArthurZucker closed this May 13, 2022

ArthurZucker deleted the opt-tf-flax branch May 13, 2022 07:35

ArthurZucker restored the opt-tf-flax branch May 13, 2022 07:37

ArthurZucker deleted the opt-tf-flax branch May 13, 2022 07:40

patrickvonplaten mentioned this pull request May 24, 2022

Opt in flax and tf #17388

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Opt in tf and flax #17226

Add support for Opt in tf and flax #17226

Uh oh!

ArthurZucker commented May 13, 2022

Uh oh!

ArthurZucker commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Add support for Opt in tf and flax #17226

Add support for Opt in tf and flax #17226

Uh oh!

Conversation

ArthurZucker commented May 13, 2022

What does this PR do?

Who can review?

Uh oh!

ArthurZucker commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants